Docket He. 58044-A/JPW/EMW 
IN THE UNITED STATES PATENT AND TRADEMARK OFFICE £ 1 



Honorable Assistant Commissioner for Patents 
Washington, D.C. 20231 



Transmitted herewith for filing are the specification and claims of the 
patent application of: 

Riccardo Dalla-Favera f or 

inventor (S) isolation of five novel genes CODING FOR NEW Fc receptors-type melanoma 

I NVOLVED IN THE PATHOGENESIS OF LYMPHOMA/MYELOMA 

Title of Invention 

Also enclosed are: 

X 34 sheet (s) of informal x formal drawings. 

X oath or declaration of Applicant (s) . (unsigned) 

x A power of attorney 
An assignment of the invention to 



x A Preliminary Amendment 

y Applicant (s) hereby establish small entity status under 37 C.F.R. §1.9 and §1.27. 

The filing fee is calculated as follows: 

CLAIMS AS FILED. LESS ANY CLAIMS CANCELLED BY AMENDMENT 















RATE 




F 






NUMBER 
FILED 




NUMBER 
EXTRA* 




SMALL 
ENTITY 


OTHER 
ENTITY 




SMALL 
ENTITY 


OTHER 
ENTITY 


Total Claims 


33-20 




13 


X 


$ 9.00 


$18.00 




$117.00 


$ 


Independent 
Claims 


7-3 




4 


X 


$40.00 


$80.00 




$160.00 


$ 


Multiple Dependent 
Claims Presented: 


Yes x 


No 


$135.00 


$270.00 




$ 


$ 


*If the different in Col. 


1 is 






BASIC FEE 


$355.00 


$ 710.00 


less than zero, enter M 
Col. 2 


0" 


in 






TOTAL FEE 


$632.00 


$ 



, Riccardo Dalla-Favera 

' U.S. Serial No.: Not Yet Known 

Filed: Herewith November 28, 2000 

Letter of Transmittal 
Page 2 



Y A check in the amount of S 632.00 to cover the filing fee. 

Please charge Deposit Account No. in the amount of 

$ . 

X The Commissioner is hereby authorized to charge any additional fees 
which may be required in connection with the following or credit any 
over -payment to Account No. 03-3125 ; 

X Filing fees under 37 C.F.R. SI- 16. 

X Patent application processing fees under 37 C.F.R. SI. 17. 

The issue fee set in 37 C.F.R. SI* 18 at or before mailing of the 

Notice of Allowance, pursuant to 37 C.F.R. 51.311(b). 

X Three copies of this sheet are enclosed. 

A certified copy of previously filed foreign application No. 

filed in on 



Applicant (s) hereby claim priority based upon this aforementioned 
; foreign application under 35 U.S.C. $119. 

Other (identify) Express Mail Certif icate of Mailing bearing Label No. 

EK 166 982 810 PS dated November 28, 2000, and a loose 
set of Formal Drawings (34 sheets). 



Respectfully submitted, 

Johik P . \ White 
Registration No. 28,678 
Attorney for Applicants 
Cooper Sc Dunham LLP 
118 5 Avenue of the Americas 
New York, New York 10036 
(212) 278-0400 



Dkt. 58 044-A/JPW/EMW 
IN THE UNITED ST ATES PATENT AND TRADEMARK OFFICE 

Applicant : Riccardo Dalla-Favera 

Serial No. : Not Yet Known 

Filed : Herewith 

For : ISOLATION OF FIVE NOVEL GENES CODING FOR NEW 

Fc RECEPTORS -TYPE MOLECULES INVOLVED IN THE 
PATHOGENESIS OF LYMPHOMA/MYELOMA 

118 5 Avenue of the Americas 
New York, New York 10 036 
November 28, 2 000 

Assistant Commissioner for Patents 
Washington, D.C. 20231 

Box: Patent Application 

PRELIMINARY AMENDMENT 

Please amend the above- identified application as follows: 

In the claims : 

Please cancel claims 26-32 and 48-68 without disclaimer or 
prejudice to applicant's right to pursue the subject matter of 
these claims in a future continuation or divisional application. 

REMARKS 

Applicant has hereinabove canceled claims 26-32 and 48-68 without 
prejudice to applicant's right to pursue the subject matter of 
these claims in a continuation or divisional application at a 
later date. Accordingly, upon entry of the amendment claims 1-25 
and 33-47 will be pending and under examination. Accordingly, 



Riccardo Dalla-Favera 

U.S. Serial No.: Not Yet Known 

Filed: Herewith 

Page 2 

applicant respectfully requests entry of the Amendment. 

If a telephone interview would be of assistance in advancing 
prosecution of the subject application, applicant's undersigned 
attorney invites the Examiner to telephone him at the number 
provided below. 

No fee is deemed necessary in connection with the filing of this 
Preliminary Amendment. However, if any fee is required 
authorization is hereby given to charge the amount of such fee 
to Deposit Account No. 03-3125. 



Respectfully submitted, 




Attorney for Applicant 
Cooper & Dunham LLP 
1185 Avenue of the Americas 
New York, New York 10 036 
(212) 278-0400 



Dkt. 58044-A/JPW/EMW 



to 

Unitei* Stairs fatter* ?atntt 



So ail mham tt mag ronrrrn: 

Be it fenotwi that i, Riccardo Dalla-Favera 

have invented certain new and useful improvements in 

ISOLATION OF FIVE NOVEL GENES CODING FOR NEW Fc RECEPTORS— TYPE MELANOMA 
INVOLVED IN THE PATHOGENESIS OF LYMPHOMA/ MYELOMA 

of which the following is a full, clear and exact description. 
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ISOLATION" OF FIVE NOVEL GENES 
CODING FOR NEW Fc RECEPTORS -TYPE MELANOMA 
INVOLVED IN THE PAT HOGENESIS OF LYMPHOMA /MELANOMA 

5 This application claims benefit of copending U.S. 

Provisional Application Serial No. 60/168,151, filed 
November 29, 1999, the contents of which are hereby 
incorporated by reference into the present application. 

10 The invention disclosed was herein made in the course of 

work under NCI Grant No. CA 44029 from the National Cancer 
Institute. Accordingly, the U.S. Government has certain 
rights in this invention. 

Tp Throughout this application, various references are 

referred to in parentheses. Disclosures of these 

publication in their entireties are hereby incorporated by 
reference into this application to more fully describe the 
state of the art to which this invention pertains. Full 

2 0 bibliographic citation for these references may be found 

at the end of this application, preceding the claims. 

BACKGROUND OF THE INVENTION 

2 5 Abnormalities of chromosome lq21 are common in B cell 

malignancies, including B cell lymphoma and myeloma, but 
the genes targeted by these aberrations are largely 
unknown. By cloning the breakpoints of a t(l;14) (q21,-q32) 
chromosomal translocation in a myeloma cell line, we have 

30 identified two novel genes, IRTA1 and IRTA2 , encoding cell 

surface receptors with homologies to the Fc and Inhibitory 
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Receptor families. Both genes are normally expressed in 
mature B cells, but with different distributions in 
peripheral lymphoid organs: IRTA1 is expressed in marginal 
zone B cells, while IRTA2 is also expressed in germinal 
5 center centrocytes and in immunoblasts . As the result of 

the t(l;14) translocation, the IRTA1 signal peptide is 
fused to the Immunoglobulin Ca domain to produce a 
chimaeric IRTAl/Ca fusion protein. In Multiple Myeloma 
and Burkitt lymphoma cell lines with lq21 abnormalities, 
10 IRTA2 expression is deregulated. Thus, IRTA1 and IRTA2 

are novel immunoreceptors with a potentially important 
role in B cell development and lymphomagenesis . 

B-cell Non-Hodgkin' s Lymphoma (B-NHL) and Multiple Myeloma 

15 (MM) represent a heterogeneous group of malignancies 

derived from mature B cells with phenotypes corresponding 
to pre-Germinal Center (GC) (mantle cell) , GC (follicular, 
diffuse large cell, Burkitt' s), or post-GC B cells (MM) 
(for review, Gaidano and Dalla-Favera , 1997; Kuppers et 

20 al . , 1999) . Insights into the pathogenesis of these 

malignancies have been gained by the identification of 
recurrent clonal chromosomal abnormalities characteristic 
for specific disease subtypes. The common consequence of 
these translocations is the transcriptional deregulation 

25 of protooncogenes by their juxtaposition to heterologous 

transcriptional regulatory elements located in the partner 
chromosome (Gaidano and Dalla-Favera, 1997) . These 
heterologous transcriptional regulatory elements can be 
derived from the Immunoglobulin (IG) locus or from other 

30 partner chromosomal loci. Examples include MYC in 

t (8 ; 14) (q24 ;q32) in Burkitt ' s lymphoma (BL) (Dalla-Favera 
et al., 1982; Taub et al . , 1982), the CCND1 gene 



deregulated by the t (11; 14) (ql3 ;q32) in mantle cell 
lymphoma (MCL) (Rosenberg et al., 1991) and multiple 
myeloma (MM) (Ronchetti et al . , 1999), BCL2 involved in 
the t (14;18) (q32;q21) in follicular lymphoma (FL) (Bakhshi 
5 et al., 1985), BCL6 in t (3 ; 14 ) (q27 ; q32 ) in diffuse large B 

cell lymphoma (DLCL) (Ye et al . , 1993), as well as FGFR3 
in t (4 ; 14) (pl6 ;q32) (Chesi et al . , 1997), MAF in 
t (14;16) (q32;q23) (Chesi et al . , 1998) and MUM1/IRF4 in 
t (6, -14) (p25;q32) (Iida et al . , 1997) in multiple myeloma 
10 (MM) . The identification of these oncogenes has offered 

valuable insights into the pathogenesis and diagnosis of 
their corresponding malignancies. 

Chromosomal abnormalities involving band Iq21-q23 are 

15 among the most frequent genetic lesions in both B -NHL and 

MM. Among NHL subtypes, translocation breakpoints at 
Iq21-q23, including translocations and duplications, have 
been reported, often as the single chromosomal 
abnormality, in 17-20% of follicular and diffuse large B- 

2 0 cell lymphoma (DLCL) , in 3 9% of marginal - zone B cell 

lymphoma (Offit et al . , 1991; Whang-Peng et al . , 1995; 
Cigudosa et al . , 1999) and in 27-38% of Burkitt lymphoma, 
where they represent the second most common cytogenetic 
abnormality after translocations involving the MYC proto- 

25 oncogene (Berger and Bernheim, 1985; Kornblau et al . , 

1991) . Comparative genome hybridization (CGH) has also 
identified Iq21-q23 as a recurring site for high-level 
amplification in 10% of DLCL cases (Rao et al . , 1998). In 
MM, trisomy of the Iq21-q32 region has been reported in 

30 20-31% of cases (Sawyer et al . , 1995), amplification of 

the lql2-qter region in 80% of cell lines and 40% of 
primary tumors (Avet -Loiseau et al . , 1997), and nonrandom 



unbalanced whole-arm translocations of lq, associated with 
the multiduplication of the adjacent lq21-22 region, were 
found in 23% of patients with abnormal karyotypes (Sawyer 
et al . , 1998) . 

The high frequency of involvement of lq21 structural 
rearrangements in B-cell malignancies suggests that this 
locus may harbor genes critical to the pathogenesis of 
these diseases. Cloning of a t (1 ; 14) (q21;q32 ) in a pre-B 
cell acute lymphoblastic leukemia cell line previously 
identified a novel gene, BCL9 deregulated in this single 
case (Willis et al . , 1998), but not involved in other 
cases. A recent report characterized the t(l;22) (q22;qll) 
in a follicular lymphoma (FL) cell line and found that the 
FCGR2B locus, encoding the low affinity IgG Fc receptor 
FCGRI IB , was targeted in this cell line and in two 
additional FL cases (Callanan et al . , 2000). Finally, the 
MUC1 locus has been identified in proximity of the 
breakpoint of a t (1 ; 14 ) (q21 ; q32) in NHL (Dyomin et al . , 
2000; Gilles et al . , 2000), and MUC1 locus rearrangements 
have been found in 6% of NHL with lq21 abnormalities 
(Dyomin et al . , 2000). These results highlight the 
heterogeneity of the lq21 breakpoints and the need to 
identify additional candidate oncogenes situated in this 
locus, since the large majority of these alterations 
remain unexplained. 

The aim of this study was to further explore the 
architecture of lq21 chromosomal rearrangements in B cell 
malignancy. To that end, we have employed a molecular 
cloning approach of the t(l;14) (q21;q32) present in the 
myeloma cell line FR4 . We have identified two novel genes 



that are differentially targeted by lq21 abnormalities. 
These genes code for five novel members of the 
immunoglobulin receptor family, IRTA1, IRTA2 , IRTA3 , IRTA4 
and IRTA5 (immunoglobulin superfamily Receptor 

Translocation Associated genes 1, 2, 3, 4, and 5), which 
may be important for normal lymphocyte function and B cell 
malignancy. 



SUMMARY OF THE INVENTION 



This invention provides an isolated nucleic acid molecule 
which encodes immunoglobulin receptor, Immunoglobulin 
5 superfamily Receptor Translocation Associated, IRTA, 

protein . 

This invention provides a method of producing an IRTA 
polypeptide (protein) which comprises: (a) introducing a 
10 vector comprising an isolated nucleic acid which encodes 

j =3 an immunoglobulin receptor, Immunoglobulin superfamily 

%J Receptor Translocation Associated, IRTA, protein into a 

suitable host cell; and (b) culturing the resulting cell 
: ^ so as to produce the polypeptide. 

15 

y = This invention provides an isolated nucleic acid molecule 

comprising at least 15 contiguous nucleotides capable of 

W specifically hybridizing with a unique sequence included 

within the sequence of the isolated nucleic acid molecule 

2 0 encoding IRTA protein. In an embodiment, the IRTA protein 

may be IRTA1, IRTA2 , IRTA3 , IRTA4 or IRTA 5 protein, or 
fragment (s) thereof, having the amino acid sequence set 
forth in any of Figures 18A, 18B-1-18B-3, 18C-1-18C-2, 
18D-1-18D-2 or 18E-1-18E-2, respectively. 

25 

This invention provides a method for detecting a B cell 
malignancy or a type of B cell malignancy in a sample from 
a subject wherein the B cell malignancy comprises a lq21 
chromosomal rearrangement which comprises: a) obtaining 

3 0 RNA from the sample from the subject; b) contacting the 

RNA of step (a) with a nucleic acid molecule of at least 
15 contiguous nucleotides capable of specifically 
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hybridizing with a unique sequence included within the 
sequence of an isolated RNA encoding human IRTA protein 
selected from the group consisting of human IRTA1 , IRTA2 , 
IRTA3, IRTA4 and IRTA5, under conditions permitting 
5 hybridization of the RNA of step (a) with the nucleic acid 

molecule capable of specifically hybridizing with a unique 
sequence included within the sequence of an isolated RNA 
encoding human IRTA protein, wherein the nucleic acid 
molecule is labeled with a detectable marker; and c) 
10 detecting any hybridization in step (b) , wherein detecion 

of hybridization indicates presence of B cell malignancy 
or a type of B cell malignancy in the sample. 

This invention provides an antisense oligonucleotide 
IS having a sequence capable of specifically hybridizing to 

an mRNA molecule encoding a human IRTA protein so as to 
prevent overexpression of the mRNA molecule. 

This invention provides a purified IRTA1 protein 
2 0 comprising the amino acid sequence set forth in Figure 18A 

(SEQ ID N0:1) . 



This invention provides 
comprising the amino acid 
25 18B-1-18B-3 (SEQ ID NO : 3 ) . 



a purified IRTA2 protein 
sequence set forth in Figures 



This invention provides a purified IRTA3 protein 
comprising the amino acid sequence set forth in Figures 
18C-1-18C-2 (SEQ ID NO : 5 ) . 



30 



This invention provides a purified IRTA4 protein 
comprising the amino acid sequence set forth in Figures 
18D-1-18D-2 (SEQ ID NO: 7) . 

This invention provides a purified IRTA5 protein 
comprising the amino acid sequence set forth in Figures 
18E-1-18E-2 (SEQ ID NO: 9) . 

This invention provides an antibody/antibodies directed to 
an epitope of a purified IRTA1 , IRTA2 , IRTA3 , IRTA4 or 
IRTA5 protein, or fragment (s) thereof, having the amino 
acid sequence set forth in any of Figures 18A, 18B-1-18B- 
3, 18C-1-18C-2, 18D-1-18D-2 or 18E-1-18E-2. 

This invention provides an antibody directed to a purified 
IRTA protein selected from the group consisting of IRTA1, 
IRTA2 , IRTA3 , IRTA4 and IRTA5 . 

This invention provides a pharmaceutical composition 
comprising an amount of the antibody directed to an IRTA 
protein effective to bind to cancer cells expressing an 
IRTA protein selected from the group consisting of human 
IRTA1 , IRTA2, IRTA3 , IRTA4 and IRTA 5 so as to prevent 
growth of the cancer cells and a pharmaceutically 
acceptable carrier. 

This invention provides a pharmaceutical composition 
comprising an amount of any of the oligonucleotides of 
nucleic acid molecules encoding IRTA proteins described 
herein effective to prevent overexpression of a human IRTA 
protein and a pharmaceutically acceptable carrier capable. 



This invention provides a method of diagnosing B cell 
malignancy which comprises a lq21 chromosomal 
rearrangement in a sample from a subject which comprises: 
a) obtaining the sample from the subject; b) contacting 
5 the sample of step (a) with an antibody directed to a 

purified IRTA protein capable of specifically binding with 
a human IRTA protein selected from the group consisting of 
human IRTA1 , IRTA2 , IRTA3 , IRTA4 and IRTA5 IRTA protein on 
a cell surface of a cancer cell under conditions 

10 permitting binding of the antibody with human IRTA protein 

: iS on the cell surface of the cancer cell, wherein the 

antibody is labeled with a detectable marker; and c) 
detecting any binding in step (b) , wherein detecion of 
binding indicates a diagnosis of B cell malignancy in the 

15 sample . 

This invention provides a method of detecting human IRTA 
protein in a sample which comprises : a) contacting the 
sample with any of any of the above-described ant i- IRTA 
20 antibodies under conditions permitting the formation of a 

complex between the antibody and the IRTA in the sample; 
and b) detecting the complex formed in step (a) , thereby 
detecting the presence of human IRTA in the sample. 

25 This invention provides a method of treating a subject 

having a B cell cancer which comprises administering to 
the subject an amount of anti-IRTA antibody effective to 
bind to cancer cells expressing an IRTA protein so as to 
prevent growth of the cancer cells and a pharmaceutically 

30 acceptable carrier, thereby treating the subject. 
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This invention provides a method of treating a subject 
having a B cell cancer which comprises administering to 
the subject an amount of an antisense oligonucleotide 
having a sequence capable of specifically hybridizing to 
5 an mRNA molecule encoding a human ITRA protein so as to 

prevent overexpression of the human IRTA protein, so as to 
arrest cell growth or induce cell death of cancer cells 
expressing IRTA protein (s) and a pharmaceutical^ 
acceptable carrier, thereby treating the subject. 

0 

3 The invention also provides a pharmaceutical composition 

j comprising either an effective amount of any of the 

= oligonucleotides described herein and a pharmaceut ically 

acceptable carrier. 



The invention also provides a pharmaceutical composition 
comprising either an effective amount of an antibody 
directed against an epitope of any IRTA protein described 
herein and a pharmaceutically acceptable carrier. 



BRIEF DESCRIPTION OF THE FIGURES 

Figures 1A-1B. Molecular cloning of the translocation 
t (1;14) (q21;q32) in the FR4 multiple 
myeloma cell line. Fig. 1A) Schematic 
representation of the AFR4B-5 and AFR4S-a 
clones, representing der(14) and der(l) 
breakpoints, and of the germline IgH and 
lq21 loci. Fig. IB) Nucleotide sequence 
of the breakpoint junction and its 
alignment to the corresponding germline 
regions of chromosome 14. Sex, IgA switch 
region; LCR: 3' IgH locus control region; 
B, BamHI; H, Hindlll; X, Xhol . 

Figures 2A-2B. Genomic map of the lq21 locus in the 
vicinity of the FR4 breakpoint. Fig. 2A) 
Restriction endonuclease map and schematic 
representation of genomic clones, i.e. 
bacteriophages (1) , PI artificial 

chromosomes (PACs) (2) , and yeast 

artificial chromosome (YAC) (3) , spanning 
the germline lq21 locus at the FR4 
breakpoint region (arrowhead) . The name of 
each clone is placed directly on top of its 
representation. End fragments derived from 
the PAC and YAC inserts are depicted as 
circles, with either an SP6/T7 vector 
orientation (PAC) , or left/right arm vector 
orientation (YAC) . The top panel in Fig. 
1A depicts the genomic organization of two 
genes surrounding the FR4 breakpoint. The 



two genes were identified by exon trapping 
of PAC 49A16. They are closely spaced in 
the genome, within < 3 0 Kb of each other 
and are named MUM 2 and MUM 3 (multiple 
myeloma- 2 and 3) . In the scheme of their 
genomic loci, black boxes indicate coding 
exons, whereas white and light or medium 
grey boxes indicate non-coding exons. 
Connecting introns are lines. MUM 3 (left) 
gives rise to three alternatively spliced 
mRNAs, all sharing a common 5' untranslated 
region (UTR) but diverse 3 ' UTRs (marked by 
different shades) . Numbers underneath the 
boxes identify the order of exons in the 
cDNA. Exons less than 100 bp are depicted 
as thin vertical lines. The position and 
size of each exon was determined by 
sequencing of genomic PAC and phage clones 
and by hybridization of cDNA probes to 
endonuclease-digested clone DNA. PAC and 
YAC mapping was performed by partial 
digestion with rare cutting enzymes 
followed by Pulse-Field-Gel-Electrophoresis 
and hybridization to internal and end- 
derived probes. Dashed lines align 
regions of overlap. S, Sad; H, Hindi II; 
S, Swal; Pc, Pad; P, Pmel ; Fig. 2B) 
Genethon genetic linkage map of lq21 in the 
region of the MUM2/MUM3 locus. Sequence- 
tagged sites (STS) are ordered in 
approximate distance previously determined 
by Dib, C, et al . (1996) Nature . 380:162- 
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164. STS WI-5435 (in bold) is contained 
within YAC 23GC4 and PAC 49A16. Parallel 
vertical lines represent interrupted 
segments, whose approximate size is 
5 depicted above in megabases (MB) . Sizing 

was estimated by the size of nonchimeric 
YAC contigs between two markers . The BCL9 
gene at the centromere was cloned from a 
different t ( 1 ; 14 ) (q2 1 ; q32 ) breakpoint by 
10 Willis T.G. et al . , (1998) Blood 91, 

6:1873-1881. The FcGRIIA gene is at the 
Iq21-q2 2 chromosomal band border. 



Figures 3A-3C. MUM2 mRNA structure and expression pattern. 

Fig. 3A) Schematic representation of MUM2 
mRNA. Pattern-filled, wide boxes represent 
coding domains and narrow empty boxes 
represent untranslated regions. SP, signal 
peptide; EC, extracellular domain; TM, 
transmembrane domain; CYT, cytoplasmic 
domain; A(n), polyA tail. The 
extracellular region is composed of four 
immunoglobulin- like domains as depicted. 
Alternative polyadenylat ion signals 

(arrows) generate three MUM 2 mRNA species 
(a, b, c) whose length (in Kb) ranges from 
2.6-3.5. Fig. 3B) Northern blot analysis 
of MUM 2 mRNA expresion in human tissues of 
the immune system. The cDNA probe used for 
the analysis is shown as a solid bar 
underneath the mRNA scheme in Fig. 3A) . 
Each lane contains 2 fig mRNA of the 



corresponding tissue. On the right side of 
the blot, the position of RNA molecular 
weight markers is depicted. The position 
of MUM2 and GAPDH mRNA transcripts is shown 
by arrows. (A GAPDH probe was included in 
the hybridization as an internal control - 
0.15 ng labelled +50 ng unlabelled probe-). 
The rsults of this analysis show weak 
expression of MUM2 in lymph node and 
spleen. MUM2 expression was not detected 
in a variety of other human tissues (data 
not shown) . Fig. 3C) Northern blot 

analysis of MUM2 expression in total RNA 
from EREB, a conditional EBV- transformed B 
lymphoblastoid cell line. EREB carries the 
EBV genome with an EBNA2 -estrogen receptor 
fusion protein, active only in the presence 
of estrogen. For this experiment, cells 
were grown in the presence of estrogen 
(l/xg/ml) , followed by estrogen withdrawal 
for the indicated times. Upon estrogen 
withdrawal, EREB cells undergo G0/G1 
arrest, determined by the loss of c-myc 
expression. In Fig. 3C, a Northern blot of 
EREB total RNA (10/xg per lane) was 
hybridized with the MUM2 cDNA probe shown 
in Fig. 3A and the GAPDH internal control 
probe, as in Fig. 3B. Arrows indicate the 
position of the corresponding mRNAs on the 
EREB blot. a, band c correspond to the 
MUM 2 species in panel Fig 3A. The same 
blot was then stripped and reprobed with a 
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c-myc cDNA probe (exon 2) to verify 
cellular G0/G1 arrest. Quantitation of 
MUM2 mRNA by the use of a phosphorimager 
densitometric analysis demonstrates a 10- 
5 fold increase in their levels within 48 hrs 

of estrogen withdrawal, suggesting that 
MUM2 expression is elevated as the cells 
enter a resting phase. 



Figures 4A-4B. MUM 3 mRNA structure and expression pattern. 

Fig. 4A) Schematic representation of MUM 3 
mRNA Pattern-filled, wide boxes represent 
coding domains and narrow empty or gray 
boxes represent untranslated regions. SP, 
signal peptide; EC, extracellular domain; 
TM, transmembrane domain; CYT, cytoplasmic 
domain; A(n), polyA tail. The 

extracellular region is composed of 
immunoglobulin- like domains, as depicted. 
Alternative splicing generates four mRNA 
species with diverse subcellular 

localization. MUM3-a and -d proteins are 
secreted, whereas MUM3-b contains a 
hydrophobic stretch of amino acids at its 
C- terminus which may serve as a signal for 
addition of a glycophosphatidyl- inositol 
anchor (GPI-anchor) , as shown. MUM3-C 
spans the plasma membrane. Sequence 
identity among species is indicated by 
identical filling. Fig. 4B) Northern blot 
analysis of MUM 3 mRNA expression in 
multiple human tissues (left) and in 
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various lymphoid and non- lymphoid cell 
lines (right) . The cDNA probe used is 
shown as a solid bar below the cDNA scheme 
in Fig. 4A. Each lane contains mRNA of 

the corresponding tissue or cell line. The 
position of MUM3 and GAPDH mRNA transcripts 
is shown by arrows. (A GAPDH probe was 
included in the hybridization as an 
internal control as described in Fig. 3) 
a, b, c and d correspond to the MUM 3 mRNA 
species shown in Fig. 4A. RD, NC42 and 
CB33, Epstein-Barr virus transformed B 
lymphoblastoid cell lines; EREB , 

conditional E B V - t r an s f o r m e d B 

lymphoblastoid cell line; FR4 , plasma cell 
line; M0LT4 and HUT7 8 , T cell lines; HL60 
and U937, myelomonocyt ic cell lines; K562, 
erythroid cell line. The results suggest 
that MUM3 is expressed solely in the immune 
system tissues of bone marrow, lymph and 
spleen and in particular in B cells with a 
lymphoblastoid phenotype . 

Figure 5. Nucleotide and amino acid sequence of human 

MUM 2 . The deduced amino acid sequence is 
shown above the nucleotide sequence in one- 
letter code and is numbered on the right, 
with position 1 set to the first codon of 
the signal peptide. The predicted signal 
peptidase site was derived by a computer 
algorithm described in Nielsen et al. , 
Protein Engineering 10, 1-6 (1997) and is 



marked by an arrowhead. The 
polyadenylation signal AATAAA is 

underlined. Potential sites for N- 

glycosylation are also underlined in the 
amino acid sequence. A hydrophobic stretch 
of 16 amino acids predicted to span the 
plasma membrane is doubly underlined. 
Consensus SH2 -binding sites are highlighted 
by a wavy underline. 

Nucleotide and amino acid sequence of human 
MUM3-a. The deduced amino acid sequence is 
shown above the nucleotide sequence in one- 
letter code and is numbered on the right, 
with position 1 set to the first codon of 
the signal peptide. The predicted site for 
signal peptidase cleavage was derived as 
previously stated above and is marked by an 
arrowhead. The polyadenylation signal 

ATTAAA is underlined. Potential sites for 
N-glycosylation are also underlined in the 
amino acid sequence. The protein lacks a 
transmembrane domain and is predicted to be 
secreted . 

Nucleotide and amino acid sequence of human 
MUM3-b. The deduced amino acid sequence is 
shown above the nucleotide sequence in one- 
letter code and is numbered on the right, 
with position 1 set to the first codon of 
the signal peptide. The predicted site for 
signal peptidase cleavage was derived as 



previously stated above and is marked by an 
arrowhead. The polyadenylat ion signal 

AATAAA is underlined. Potential sites for 
N-glycosylation are underlined in the amino 
acid sequence. 

Figure 6C-1-6C-2. Nucleotide and amino acid sequence of 
human MUM3-C. The deduced amino acid 
sequence is shown above the nucleotide 
sequence in one- letter code and is 
numbered on the right, with position 1 
set to the first codon of the signal 
peptide. The predicted site for 

signal peptidase cleavage was derived 
as previously stated above and is 
marked by an arrowhead. The 
polyadenylation signal AATAAA is 
underlined. Potential sites for N- 
glycosylation are underlined in the 
amino acid sequence . A hydrophobic 
stretch of 23 amino acids predicted to 
span the plasma membrane is doubly 
underlined. Consensus SH2 -binding 

sites are highlighted by a wavy 
underline . 

Figures 7A-7C. t (1 ; 14) (q21 ; 32 ) in FR4 generates a MUM2/Ca 
fusion transcript. Fig. 7A) Schematic 
representation of ther der(14) genomic 
clone XFR4B-5 and of the germline IgHAl 
locus. The FR4 breakpoint is marked by an 
arrow. Filled and open boxes represent the 



MUM2 and Calpha coding and non- coding exons 
respectively. The position of the MUM 2 
exon 1 probe used for Northern blot 
analysis is shown by a bar. Fig. 7B) 
Northern blot analysis with a MUM2 exon 1 
probe on FR4 and additional cell lines 
detects an abnormal message of 0.8 Kb, 
selectively in FR4 . Arrowheads point to 
the location of normal MUM 2 message in EREB 
mRNA. JJN3 and U266, myeloma cell lines; 
EREB, conditional EBV- transformed B 

lymphoblastoid cell line. Two /ig of polyA+ 
RNA were loaded per lane. Fig. 7C) 

Nucleotide ans amino acid sequence of the 
MUM2-Ca fusion cDNA in FR4 . The cDNA was 
amplified by RT-PCR from FR4 total RNA 
using the primers shown in Fig. 7A, and was 
subsequently subcloned and sequenced. The 
deduced amino acid sequence is shown above 
the nucleotide sequence in one- letter code 
and is numbered on the right with position 
1 set to the first codon of the signal 
peptide. The predicted site for signal 
peptidase cleavage was derived as 
previously stated above and is marked by an 
arrowhead. The polyadenylation signal 

AATAAA is underlined. The Calpha 

transmembrane domain is underlined. The 
MUM2 portion of the cDNA is shown on 
italics. H, Hindi I I; B, BamHI ; X, Xhol; 
Sa, IgA switch region; EC, extracellular 



region; TM, transmembrane; CYT, cytoplasmic 
domain . 

Figures 8A-8C. Molecular cloning of the translocation 
t (1,-14) (q21;q32) in the FR4 multiple 
myeloma cell line. Fig. 8A) Schematic 
representation of the phage clones 
representing der(14) and der(l) breakpoints 
and the germline IGH and lq21 loci. 
Chromosome 14 sequences are indicated by a 
solid black line with black boxes 
representing Cal exons . Chromosome 1 
sequences are shown as a grey line. The 
probes used for chromosomal mapping are 
indicated below the map. Restriction 
enzyme codes are: B, BamHI ; H, Hindi 1 1; X, 
Xhol; S, Sac I; E , EcoRI . For enzymes 

marked by (*) only sites delineating the 
probes are shown. Sa: IgA switch region ,- 
LCR: 3 ' IgH locus control region. Fig. 8B) 
Nucleotide sequence of the breakpoint 
junctions and their alignment to the 
corresponding germline regions of 

chromosomes 14 and 1. Fig. 8C) Left, 
fluorescence in situ hybridization (FISH) 
analysis on human normal metaphase spreads 
with the PAC clone 49A16 (Fig. 13) spanning 
the germlinelq21 region at the FR4 
breakpoint. Right, DAPI stained image from 
the same metaphase spread. 



-21- 

Figures 9A-9B. Structure of IRTA1 and IRTA2 cDNAs . Figs. 

9A,9B) Schematic representation of the 
full-length IRTA1 (Fig. 9A) and IRTA2 (Fig. 
9B) cDNAs . Pattern-filled, wide boxes 

represent coding domains and narrow boxes 
represent untranslated regions (UTR) . The 
predicted site for signal peptidase 
cleavage is marked by an arrowhead and was 
derived according to the SignallP World 
Wide Web server at 

http : / /www . cbfl ■ dtu . dk/servi ces /Signal IP . 
The transmembrane domain prediction 
algorithm is described in Tusnady et al, 
1998. SP, signal peptide; EC, 

extracellular domain; Ig, immuno-globulin- 
type; TM, transmembrane domain; CYT, 
cytoplasmic domain; A(n), polyA tail; GPI, 
glycophosphatidyl inositol. In (Fig. 9A) , 
arrows in the 3 ' UTR indicate different 
polyadenylation addition sites utilized in 
the IRTA1 cDNA. In (Fig. 9B) , different 
3' UTR regions in IRTA2 isoforms are 
differentially shaded. Bars underneath the 
UTR regions in (Fig. 9A) and (Fig. 9B) 
identify probes used for Northern blot 
analysis in Figure 12 . 
Figures 10A-10B. Comparison of the amino acid sequences 
of IRTA1 and IRTA2 with members of the 
Fc Receptor family Fig. 10A) Multiple 
sequence alignment of the first two 
(top) and the third (bottom) 
extracellular Ig-domains of IRTA1 and 



IRTA2 to Fc receptor family members. 
The sequences were compared using the 
ClustalW program (Thompson et al . , 
1994) . Black-shaded boxes indicate 
conserved aminoacids among all 
sequences; dark-grey shaded boxes 
indicate conserved aminoacids among at 
least half of the sequences; light - 
shaded boxes indicate conservative 
substitutions. Fig. 10B) Alignment of 
the SH2 -binding domains of IRTA1 and 
IRTA2 with the IT AM and ITIM consensus 
motifs. Conserved aminoacid positions 
are in bold. Symbol X represents any 
aminoacid . 



Figures 11A-11B-4. IRTA1 expression pattern. Fig. 11A) 
Left panel. Northern blot analysis of 
IRTA1 mRNA expression in tissues of 
the human immune system. Each lane 
contains 2mg mRNA. The position of 
RNA molecular weight markers is 
depicted on the right side of the 
blot. The positions of the IRTA1 and 
GAPDH mRNA transcripts are shown by 
arrows . (A GAPDH probe was included 
in the hybridization as an internal 
control-0.15 ng labelled + 50 ng 
unlabelled probe-). Right Panel. 

Northern blot analysis of IRTA1 
expression in total RNA from the ER/EB 
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cell line (10 mg per lane) . For this 
experiment, cells were grown in the 
presence of estrogen (lmg/ml) , 
followed by estrogen withdrawal for 
the indicated times. Arrows indicate 
the positions of the corresponding 
mRNAs. a, b and c correspond to the 
IRTA1 differentially polyadenylated 
species. The same blot was stripped 
and reprobed with a MYC cDNA probe 
(exon 2) to verify cellular G 0 /G 1 
arrest. Densitometric analysis of 

IRTA1 mRNA levels is plotted in the 
adjacent column graph. The cDNA probe 
used is shown as a solid bar 
underneath the IRTA1 mRNA scheme in 
Figure 9A. Fig. HB-l-llB-4) In situ 
hybridization analysis of IRTA1 
expression in serial sections of human 
tonsil. 1. Sense IRTA1 probe 2. 

Antisense IRTA1 probe 3 . H&E staining 
4. Antisense IRTA1 signal superimposed 
over an H&E stained section. GC, 
germinal center, MargZ, marginal zone 

Figure 12A-12B-4. IRTA2 expression pattern. Fig. 12A) 
Northern blot analysis of IRTA2 mRNA 
expression in multiple human tissues 
(left panel) and in various lymphoid 
and non-lymphoid cell lines (right 
panel) . Each lane contains 2mg mRNA. 



The positions of the IRTA2 and GAPDH 
transcripts are shown by arrows. a, 
b, c and d correspond to the 
alternatively spliced IRTA2 mRNA 
isoforms. RD, NC42 and CB33, Epstein- 
Barr virus transformed B 

lymphoblastoid cell lines; EREB , 
conditional EBV- transformed B 

lymphoblastoid cell line; FR4 , plasma 
cell line; M0LT4 and HUT7 8 , T cell 
lines; HL60 and U937, myelomonocyt ic 
cell lines; K562, erythroid cell line. 
The cDNA probe used is shown as a 
solid bar underneath the IRTA2 mRNA 
scheme in Figure 9B . Figs. 12B-1-12B- 
4) In situ hybridization analysis of 
IRTA2 mRNA expression in human tonsil. 
Fig. 12B-1. Sense IRTA2 cDNA probe, 
Fig. 12B-2. Antisense IRTA2 cDNA 
probe, Fig. 12B-3 . H&E staining, Fig. 
12B-4. Antisense IRTA2 cDNA probe 
signal superimposed over H&E stained 
section. GC, germinal center, MargZ, 
marginal zone 

Map of the germline lq21 region spanning 
the FR4 breakpoint and genomic organization 
of IRTA1 and IRTA2 . Primers used to 
amplify IRTA1 exons from spleen cDNA are 
marked by arrowheads on top panel. Black 
and light boxes indicate coding and non- 
coding exons respectively. Arrows indicate 
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position of BCL9, MUC1, IRTA family and 
FCQRIIB loci. S, Sad; H, Hindlll; S, 
Swal; Pc, Pad; P, Pmel; Mb, Megabases 



Figures 14A-14D. t (1 ; 14) (q21;q32) in FR4 generates an 

IRTAl/Ca fusion transcript. Fig. 14A) 
Schematic representation of the 
der(14) genomic clone 1FR4B-5 and of 
the germline IgCa x locus. The FR4 
breakpoint is marked by an arrow. 
Filled and open boxes represent the 
IRTA1 and Ca 2 coding and non- coding 
exons respectively. Fig. 14B) 

Northern blot analysis with an IRTA1 
exon 1 probe (shown by a bar in Fig. 
14A) on FR4 and additional cell lines 
detects an abnormal message in FR4 . 
Arrowheads point to the location of 
normal IRTA1 message in ER/EB mRNA . 
JJN3 and U266, myeloma cell lines. 
Two mg of polyA+ RNA loaded per lane . 
Fig. 14C) Schematic representation of 
the IRTAl/Ca fusion cDNA in FR4 . The 
cDNA was amplified by RT-PCR from FR4 
total RNA using the primers shown in 
(Fig. 14A) , and sequenced after 
subcloning. Fig. 14D) SDS/PAGE 

analysis of immunoprec ipi tat es 

obtained from vector control 

transfected and IRTAl/Ca transient 
expression construct transfected 293 -T 



10 Figures 15A-15B 
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cells (lanes 1 & 2), or the following 
cell lines: mlgA positive 

lymphoblastoid cell line-Dakiki (lane 
3) , FR4 (lane 4) , mlgM positive NHL 
cell line-Ramos (lane 5). H, Hindi II; 
B, BamHI; X,XhoI; Sa, IgA switch 

region; EC, extracellular region; TM, 
transmembrane; CYT, cytoplasmic 

IRTA2 expression is deregulated in 
cell lines carrying lq21 

abnormalities. Figs. 15A, 15B) 

Northern blot analysis of IRTA2 mRNA 
expression in Burkitt lymphoma (Fig. 
15A) and Multiple Myeloma (Fig. 15B) 
cell lines. The cDNA probe used is 
the same as in Fig. 12. Each lane 
contains 2mg mRNA. The positions of 
the IRTA2 and GAPDH mRNA transcripts 
are shown by dashes and arrows, 
respectively. The relative levels of 
IRTA2 mRNA expression in the left 
panel (Fig. 15A) were plotted on the 
right panel (Fig. 15A) after 
densitometric analysis and 

normalization versus the GAPDH levels. 
The right panel of (Fig. 15B) is a 
summary of the Northern blot analysis 
results . 



Figures 16-1-16-4 IRTA1 expression in normal lymphoid 
tissue. Paraffin-embedded sections 
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from normal human tonsil were stained 
with the following antibodies: Fig. 
16-1) Negative control; Fig. 16-2) 
anti-CD3 mouse monoclonal to detect T 
5 cells; Fig. 16-3) anti-IRTAl (mIRTA) 

mouse monoclonal; Fig. 16-4) anti- 
IRTAl (J92884K) rabbit polyclonal. 
IRTA1 positive cells are located in 
the perifollicular and intraepithelial 
10 region of the tonsil, the equivalent 

of the marginal zone in the spleen. 



Figure 17 IRTA1 expression in a stomach Mucosa- 

Associated-Lymphoid Tissue (MALT) B cell 
lymphoma. A paraffin- embedded section from 
a stomach MALT B cell lymphoma was stained 
with the anti-IRTAl (mIRTA) mouse 
monoclonal antibody and counterstained with 
H&E. The majority of MALT lymphomas 

analyzed were IRTA1 positive. This 
antibody therefore can be an effective tool 
in the differential diagnosis of MALT 
lymphoma. The mIRTAl antibody may also be 
proven useful in the therapy of this B cell 
tumor, similarly to the use of the anti- 
CD20 antibody (Rituximab) in the therapy of 
relapsed CD2 0 -positive lymphomas (Foon K. , 
Cancer J. 6: p273). 



30 



Figure 18A. 



IRTA1 cDNA and the amino acid sequence of 
the encoded IRTA1 protein. 



Figures 18B-1-18B-3 



Figures 18C-1-18C-2 



Figures 18D-1-18D-2 
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IRTA2 cDNA and the amino acid 
sequence of the encoded IRTA2 
protein . 

IRTA3 cDNA and the amino acid 
sequence of the encoded IRTA3 
protein . 

IRTA4 cDNA and the amino acid 
sequence of the encoded IRTA4 
protein . 



Figures 18E-1-18E-2 . 



IRTA5 cDNA and the amino acid 
sequence of the encoded IRTA5 
protein . 
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DETAILED DESCRIPTION OF THE INVENTION 

The following standard abbreviations are used throughout 
the specification to indicate specific nucleotides: 
C=cytosine; A=adenosine ; T=thymidine and G=guanosine. 

This invention provides an isolated nucleic acid molecule 
which encodes immunoglobulin receptor, Immunoglobulin 
superf amily Receptor Translocation Associated, IRTA, 
protein . 

As used herein "Immunoglobulin Receptor Translocation 
Associated" genes, "IRTA" are nucleic acid molecules which 
encode novel immunoglobulin superf amily cell surface 
receptors in B cells which are important in B cell 
development, and whose abnormal expression, e.g. 
deregulated expression, perturbs cell surface B cell 
immunological responses and thus is involved in B cell 
malignancy, including lymphomagenesis . 

Nucleic acid molecules encoding proteins designate "MUM-2" 
and "MUM-3" proteins in the First Series of Experiments are 
now called "IRTA-1" and "IRTA-2" genes, i.e. nucleic acid 
molecules which encode IRTA-1 and IRTA-2 proteins 
respectively. IRTA- 3, -4 and -5 proteins are members of 
the same the immunoglobulin gene superfamily as are the 
IRTA-1 and IRTA-2 proteins. 

In an embodiment of the above-described isolated nucleic 
acid molecule which encodes immunoglobulin receptor, 
Immunoglobulin superfamily Receptor Translocation 

Associated, IRTA, protein, the encoded IRTA protein is 



-30- 

IRTA1 protein comprising the amino acid sequence set forth 
in Figure 18A (SEQ ID NO:l) . 



In another embodiment of the above-described isolated 
5 nucleic acid molecule, the encoded IRTA protein is IRTA2 

protein comprising the amino acid sequence set forth in 
Figures 18B-1-18B-3 (SEQ ID NO : 3 ) . 

In a further embodiment of the above-described isolated 
10 nucleic acid molecule, the encoded IRTA protein is IRTA 3 

protein comprising the amino acid sequence set forth in 
Figures 18C-1-18C-2 (SEQ ID NO:5) . 

In yet another embodiment of the above-described isolated 
3i5 nucleic acid molecule, the encoded IRTA protein is IRTA4 

protein comprising the amino acid sequence set forth in 
Figures 18D-1-18D-2 (SEQ ID NO: 7) . 

In a still further embodiment of the above-described 

2 0 isolated nucleic acid molecule, the encoded IRTA protein 

is IRTA5 protein comprising the amino acid sequence set 
forth in Figures 18E-1-18E-2 (SEQ ID NO: 9) . 

In another embodiment of any of the above-described 
25 isolated nucleic acid molecules, the nucleic acid molecule 

is DNA. In further embodiments, the DNA is cDNA. In 
additional embodiments, the DNA is genomic DNA. In 
another embodiment, the nucleic acid molecule is an RNA 
molecule. In yet another embodiment, the DNA molecule is 

3 0 cDNA having the nucleotide sequence set forth in Figure 

18A (SEQ ID NO: 2) . In another embodiment, the DNA molecule 
is cDNA having the nucleotide sequence set forth in Figure 



18A (SEQ ID NO: 4) . In a further embodiment, the DNA 
molecule is cDNA having the nucleotide sequence set forth 
in Figure 18A (SEQ ID NO: 6). In another embodiment, the 
DNA molecule is cDNA having the nucleotide sequence set 
forth in Figure 18A (SEQ ID NO: 8) . In an embodiment, the 
DNA molecule is cDNA having the nucleotide sequence set 
forth in Figure 18A (SEQ ID NO: 10) . In preferred 
embodiments of the isolated nucleic acid molecule, wherein 
the nucleic acid molecules encode human IRTA1 , IRTA2 , 
IRTA3, IRTA4 or IRTA5 protein. In additional embodiments, 
the nucleic acid molecules encode mammalian IRTA1 protein. 
The mammalian IRTA1 protein may be murine IRTA1 protein. 
In another preferred embodiment, the isolated nucleic acid 
molecules are operatively linked to a promoter of DNA 
transcription. In yet another preferred embodiment of the 
isolated nucleic acid molecule, the promoter comprises a 
bacterial, yeast, insect, plant or mammalian promoter. 

This invention provides a vector comprising any of the 
above -described isolated nucleic acid molecule encoding 
IRTA proteins, including but not limited to mammalian IRTA 
proteins, of which human and murine are preferred. 
In an embodiment, the vector is a plasmid. 

This invention provides a host cell comprising the above- 
described vector comprising any of the above-described 
isolated nucleic acid molecule encoding IRTA proteins. 
Preferably, the isolated nucleic acid molecules in such 
vectors are operatively linked to a promoter of DNA 
transcription. In another embodiment of the host cell, 
the cell is selected from a group consisting of a 



bacterial cell, a plant cell, and insect cell and a 
mammalian cell. 



This invention provides a method of producing an IRTA 
polypeptide (protein) which comprises: (a) introducing a 
vector comprising an isolated nucleic acid which encodes 
an immunoglobulin receptor, Immunoglobulin superfamily 
Receptor Translocation Associated, IRTA, protein into a 
suitable host cell; and (b) culturing the resulting cell 
so as to produce the polypeptide. In further embodiments, 
the IRTA protein produced by the above-described method 
may be recovered and in a still further embodiment, may be 
purified either wholly or partially. In an embodiment 
the IRTA protein may be any of IRTA1, IRTA2, IRTA3 , IRTA4 , 
or IRTA5 protein. In further embodiments, any of the IRTA 
proteins may be mammalian proteins. In still further 
embodiemnts, the mammalian proteins may be human or mouse 
IRTA proteins . 

IRTA genes (nucleic acid molecules encoding IRTA protiens 
IRTA1, IRTA2, IRTA3 , IRTA4 and IRTA5) are useful for the 
production of the IRTA proteins encoded thereby. ITRA 
proteins are useful for production of antibodies; such 
antibodies are used as reagents for differential diagnosis 
of lymphoma subtypes in hematopathology . Antibodies 
directed against IRTA proteins and which bind specifically 
to IRTA proteins also have therapeutic uses, i.e. to 
specifically target tumor cells, which may be used and 
administered similarly to "Rituximab" (an anti-CD20 
antibody) , which is an antibody approved by the FDA for 
therapy of relapsed CD20-positive lymphomas (Foon K., 
Cancer J. 6(5):273). Anti-IRTAl, anti-IRTA2, anti-IRTA3, 



anti-IRTA4 and anti-IRTA5 antibodies are also useful 
markers for isolation of specific subsets of B cells in 
researchstudies of normal and tumor B cell biology. 
Moreover, Ant i - IRTA1 , anti-IRTA2, anti-IRTA3, anti-IRTA4 
and anti-IRTA5 antibodies are useful research reagents to 
experimentally study the biology of signaling in normal 
and tumor B cells. 

Methods of introducing nucleic acid molecules into cells 
are well known to those of skill in the art. Such methods 
include, for example, the use of viral vectors and calcium 
phosphate co-precipitation. Accordingly, nucleic acid 
molecules encoding IRTA proteins IRTA1, IRTA2 , IRTA3 , 
IRTA4 and IRTA 5 may be introduced into cells for the 
production of these IRTA proteins. 

Numerous vectors for expressing the inventive proteins 
IRTA1, IRTA2, IRTA3 , IRTA4 , and IRTA5 , may be employed. 
Such vectors, including plasmid vectors, cosmid vectors, 
bacteriophage vectors and other viruses, are well known in 
the art. For example, one class of vectors utilizes DNA 
elements which are derived from animal viruses such as 
bovine papilloma virus, polyoma virus, adenovirus, 
vaccinia virus, baculovirus, retroviruses (RSV, MMTV or 
MoMLV) , Semliki Forest virus or SV40 virus. Additionally, 
cells which have stably integrated the DNA into their 
chromosomes may be selected by introducing one or more 
markers which allow for the selection of transfected host 
cells. The markers may provide, for example, prototrophy 
to an auxotrophic host, biocide resistance or resistance 
to heavy metals such as copper. The selectable marker 
gene can be either directly linked to the DNA sequences to 
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be expressed, or introduced into the same cell by 
cotransf ormation . 



Regulatory elements required for expression include 
5 promoter sequences to bind RNA polymerase and 

transcription initiation sequences for ribosome binding. 
Additional elements may also be needed for optimal 
synthesis of mRNA. These additional elements may include 
splice signals, as well as enhancers and termination 
10 signals. For example, a bacterial expression vector 

= includes a promoter such as the lac promoter and for 

: y transcription initiation the Shine -Dalgarno sequence and 

; =: the start codon AUG. Similarly, a eukaryotic expression 

HJ vector includes a heterologous or homologous promoter for 

15 RNA polymerase II, a downstream polyadenylation signal, 

L the start codon AUG, and a termination codon for 

detachment of the ribosome. Such vectors may be obtained 
ul commercially or assembled from the sequences described by 

methods well known in the art, for example the methods 
20 described above for constructing vectors in general. 

These vectors may be introduced into a suitable host cell 
to form a host vector system for producing the inventive 
proteins. Methods of making host vector systems are well 
25 known to those skilled in the art. 

Suitable host cells include, but are not limited to, 
bacterial cells (including gram positive cells) , yeast 
cells, fungal cells, insect cells and animal cells. 
30 Suitable animal cells include, but are not limited to HeLa 

cells, Cos cells, CV1 cells and various primary mammalian 
cells. Numerous mammalian cells may be used as hosts, 



including, but not limited to, the mouse fibroblast cell 
NIH-3T3 cells, CHO cells, HeLa cells, Ltk" cells and COS 
cells. Mammalian cells may be transfected by methods well 
known in the art such as calcium phosphate precipitation, 
electroporation and microinjection. 

This invention provides an isolated nucleic acid molecule 
comprising at least 15 contiguous nucleotides capable of 
specifically hybridizing with a unique sequence included 
within the sequence of the isolated nucleic acid molecule 
encoding IRTA protein. In an embodiment, the IRTA protein 
may be IRTA1, IRTA2 , IRTA3 , IRTA4 or IRTA5 protein, or 
fragment (s) thereof, having the amino acid sequence set 
forth in any of Figures 18A, 18B-1-18B-3, 18C-1-18C-2, 
18D-1-18D-2 or 18E- 1-18E-2 , respectively. In other 

embodiments, the isolated nucleic acid molecules are 
labeled with a detectable marker. In still other 
embodiments of the isolated nucleic acid molecules, the 
detectable marker is selected from the group consisting of 
a radioactive isotope, enzyme, dye, biotin, a fluorescent 
label or a chemiluminescent label. 

This invention provides a method for detecting a B cell 
malignancy or a type of B cell malignancy in a sample from 
a subject wherein the B cell malignancy comprises a lq21 
chromosomal rearrangement which comprises: a) obtaining 
RNA from the sample from the subject; b) contacting the 
RNA of step (a) with a nucleic acid molecule of at least 
15 contiguous nucleotides capable of specifically 
hybridizing with a unique sequence included within the 
sequence of an isolated RNA encoding human IRTA protein 
selected from the group consisting of human IRTA1, IRTA2 , 



IRTA3 , IRTA4 and IRTA5 , under conditions permitting 
hybridization of the RNA of step (a) with the nucleic acid 
molecule capable of specifically hybridizing with a unique 
sequence included within the sequence of an isolated RNA 
encoding human IRTA protein, wherein the nucleic acid 
molecule is labeled with a detectable marker; and c) 
detecting any hybridization in step (b) , wherein detecion 
of hybridization indicates presence of B cell malignancy 
or a type of B cell malignancy in the sample. 

Detection of hybridization of RNA encoding IRTA proteins 
will indicate that a malignancy is a B cell malignancy. 
More specifically, detection of hybridization of RNA 
encoding ITRA1 protein indicates that the B cell 
malignancy is a Mucosa-Associated-Lymphoid Tissue (MALT) B 
cell lymphoma. Detection of hybridization of RNA encoding 
ITRA4 and IRTA5 proteins indicate that the B cell 
malignancy is a mantle cell lymphoma. In an embodiment of 
the above-described method, the B cell malignancy 
comprises a lq21 chromosomal rearrangement. One of skill 
will use the above -described method as a diagnostic aid in 
conjunction with other standard methods of 
detecting/diagnosing malignancies, e.g. pathology of a 
tumor sample, which may indicate lymphoma and the above - 
described method will then narrow the malignancy to a B 
cell lymphoma or more specifically to MALT) B cell 
lymphoma or a mantle cell lymphoma as discussed supra. 

One of skill is familiar with known methods of detecting 
of hybridization nucleic acid molecules to nucleic acid 
oligonucleotides, i.e. nucleic acid probes encoding a 
protein of interest for diagnostic methods. The nucleic 



acid molecules encoding the IRTA proteins of the subject 
invention are useful for detecting B cell malignancy. One 
of skill will recognize that variations of the above- 
described method for detecting a B cell malignancy in a 
sample include, but are not limited to, digesting nucleic 
acid from the sample with restrictio enzymes and 
separating the nucleic acid molecule fragments so 
oobtained by size fractionation before hybridization. 

In an embodiemnt of the above-described method for 
detecting a B cell malignancy in a sample from a subject, 
wherein the detectable marker is radioactive isotope, 
enzyme, dye, biotin, a fluorescent label or a 
chemiluminescent label. In a preferred embodiment, the B 
cell malignancy is selected from the group consisting of B 
cell lymphoma, multiple myeloma, Burkitt ' s lymphoma, 
marginal zone lymphoma, diffuse large cell lymphoma and 
follicular lymphoma cells. In a further embodiemnt, the B 
cell lymphoma is Mucosa-Associated-Lymphoid Tissue B cell 
lymphoma (MALT) . In another preferred embodiment, the B 
cell lymphoma is non-Hodgkin ' s lymphoma. 

This invention provides an antisense oligonucleotide 
having a sequence capable of specifically hybridizing to 
an mRNA molecule encoding a human ITRA protein so as to 
prevent overexpression of the mRNA molecule. 

In preferred embodiments of the antisense oligonucleotide, 
the ITRA protein selected from the group consisting of 
human IRTA1, IRTA2 , IRTA3 , IRTA4 and IRTA5 protein. In 
further embodiments of any of the above-described 
oligonucleotides of nucleic acid molecules encoding the 



IRTA1, IRTA2 , IRTA3 , IRTA4 and/or IRTA5 proteins, the 
nucleic acid may be genomic DNA or cDNA. 



One of skill is familiar with conventional techniques for 
nucleic acid hybridization of oligonucleotides, e.g. 
Ausubel, F.M. et al . Current Protocols in Molecular 
Biology, (John Wiley & Sons, New York, 19 98) , for example 
stringent conditions of 65°C in the presence of an elevated 
salt concentration. Such conditions are used for 
completely complementary nucleic acid hybridization, 
whereas conditions that are not stringent are used for 
hybridization of nucleic acids which are not totally 
complementary . 

As used herein, the phrase "specifically hybridizing" 
means the ability of a nucleic acid molecule to recognize 
a nucleic acid sequence complementary to its own and to 
form double-helical segments through hydrogen bonding 
between complementary base pairs. As used herein, a 
"unique sequence" is a sequence specific to only the 
nucleic acid molecules encoding the IRTA1, IRTA2 , IRTA3 , 
IRTA4 and IRTA5 proteins. Nucleic acid probe technology 
is well known to those skilled in the art who will readily 
appreciate that such probes may vary greatly in length and 
may be labeled with a detectable label, such as a 
radioisotope or fluorescent dye, to facilitate detection 
of the probe. Detection of nucleic acid molecules 
encoding the IRTA1, IRTA2 , IRTA3 , IRTA4 and/or IRTA5 
proteins is useful as a diagnostic test for any disease 
process in which levels of expression of the corresponding 
IRTA1 , IRTA2 , IRTA3 , IRTA4 and/or IRTA5 proteins is 
altered. DNA probe molecules are produced by insertion of 



a DNA molecule which encodes mammalian IRTA1 , IRTA2 , 
IRTA3 , IRTA4 and/or IRTA5 proteins or fragments thereof 
into suitable vectors, such as plasmids or bacteriophages, 
followed by insertion into suitable bacterial host cells 
and replication and harvesting of the DNA probes, all 
using methods well known in the art. For example, the DNA 
may be extracted from a cell lysate using phenol and 
ethanol, digested with restriction enzymes corresponding 
to the insertion sites of the DNA into the vector 
(discussed herein) , electrophoresed, and cut out of the 
resulting gel. The oligonucleotide probes are useful for 
'in situ' hybridization or in order to locate tissues 
which express this IRTA gene family, and for other 
hybridization assays for the presence of these genes 
{nucleic acid molecules encoding any of the IRTA1 - IRTA5 
protiens) or their mRNA in various biological tissues. In 
addition, synthesized oligonucleotides (produced by a DNA 
synthesizer) complementary to the sequence of a DNA 
molecule which encodes an IRTA1 , IRTA2 , IRTA3 , IRTA4 or 
IRTA5 protein are useful as probes for these genes, for 
their associated mRNA, or for the isolation of related 
genes by homology screening of genomic or cDNA libraries, 
or by the use of amplification techniques such as the 
Polymerase Chain Reaction. 

This invention provides a purified IRTA1 protein 
comprising the amino acid sequence set forth in Figure 18A 
(SEQ ID NO:l). In an embodiment of the purified IRTA1 
protein, wherein the IRTA1 protein is human IRTA1 . 



This invention provides a purified IRTA2 protein 
comprising the amino acid sequence set forth in Figures 



18B-1-18B-3 (SEQ ID NO : 3 ) . In an embodiment of the 
purified IRTA2 protein, the IRTA2 protein is human IRTA2 . 



This invention provides a purified IRTA3 protein 
comprising the amino acid sequence set forth in Figures 
18C-1-18C-2 (SEQ ID NO : 5 ) . In an embodiment of the 
purified IRTA3 protein, the IRTA3 protein is human IRTA3 . 

This invention provides a purified IRTA4 protein 

comprising the amino acid sequence set forth in Figures 

18D-1-18D-2 (SEQ ID NO: 7) . In an embodiment of the 
purified IRTA3 protein, wherein the IRTA4 protein is human 
IRTA4 . 

This invention provides a purified IRTA5 protein 

comprising the amino acid sequence set forth in Figures 

18E-1-18E-2 (SEQ ID NO: 9) . In an embodiment of the 

purified IRTA5 protein, the IRTA5 protein is human IRTA5 . 

In order to facilitate an understanding of the 
Experimental Details section which follows, certain 
frequently occurring methods and/or terms are best 
described in Sambrook, et al.(1989) and Harlow & Lane, 
Antibodies: A Laboratory Manual . Cold Spring Harbor 
Laboratories, Cold Spring Harbor, NY: 1988. 

This invention provides an antibody/antibodies directed to 
an epitope of a purified IRTA1, IRTA2 , IRTA3 , IRTA4 or 
IRTA5 protein, or fragment (s) thereof, having the amino 
acid sequence set forth in any of Figures 18A, 18B-1-18B- 
3, 18C-1-18C-2, 18D-1-18D-2 or 18E-1-18E-2. 



As used herein, the term "antibody" includes, but is not 
limited to, both naturally occurring and non-naturally 
occurring antibodies. Specifically, the terra "antibody" 
includes polyclonal and monoclonal antibodies, and binding 
fragments thereof. Furthermore, the term "antibody" 

includes chimeric antibodies and wholly synthetic 
antibodies, and fragments thereof. The polyclonal and 
monoclonal antibodies may be "purified" which means the 
polyclonal and monoclonal antibodies are free of any other 
antibodies. As used herein, partially purified antibody 
means an antibody composition which comprises antibodies 
which specifically bind to any of the IRTA protein (s) of 
the subejct invention, and consists of fewer protein 
impurities than does the serum from which the antibodies 
are derived. A protein impurity is a protein other than 
the antibodies specific for the IRTA protein (s) of the 
subejct invention. For example, the partially purified 
antibodies may be an IgG preparation. 

Polyclonal antibodies ( ant i- IRTA antibodies) may be 
produced by injecting a host animal such as rabbit, rat, 
goat, mouse or other animal with the immunogen(s) of this 
invention, e.g. a purified human IRTA1, IRTA2 , IRTA3 , 
IRTA 4 or IRTA5 , described infra. The sera are extracted 
from the host animal and are screened to obtain polyclonal 
antibodies which are specific to the immunogen . Methods 
of screening for polyclonal antibodies are well known to 
those of ordinary skill in the art such as those disclosed 
in Harlow & Lane, Antibodden: A L aboratory Manual, (Cold 
Spring Harbor Laboratories, Cold Spring Harbor, NY: 1988) 



the contents of which are hereby incorporated by 
reference . 

The anti-IRTA monoclonal antibodies of the subject 
invention may be produced by immunizing for example, mice 
with an immunogen (the IRTA polypeptides or fragments 
thereof as described herein) . The mice are inoculated 
intraperitoneally with an immunogenic amount of the above - 
described immunogen and then boosted with similar amounts 
of the immunogen. Spleens are collected from the 

immunized mice a few days after the final boost and a cell 
suspension is prepared from the spleens for use in the 
fusion . 

Hybridomas may be prepared from the splenocytes and a 
murine tumor partner using the general somatic cell 
hybridization technique of Kohler, B. and Milstein, C, 
Nature (1975) 256: 495-497. Available murine myeloma 
lines, such as those from the American Type Culture 
Collection (ATCC) , 10801 University Boulevard, Manassas, 
VA 20110-2209, USA, may be used in the hybridization. 
Basically, the technique involves fusing the tumor cells 
and splenocytes using a fusogen such as polyethylene 
glycol. After the fusion the cells are separated from the 
fusion medium and grown in a selective growth medium, such 
as HAT medium, to eliminate unhybridized parent cells. 
The hybridomas may be expanded, if desired, and 
supernatants may be assayed by conventional immunoassay 
procedures, for example radioimmunoassay, using the 
immunizing agent as antigen. Positive clones may be 
characterized further to determine whether they meet the 
criteria of the invention antibodies. 



Hybridomas that produce such antibodies may be grown in 
vitro or in vivo using known procedures. The monoclonal 
antibodies may be isolated from the culture media or body 
fluids, as the case may be, by conventional immunoglobulin 
purification procedures such as ammonium sulfate 
precipitation, gel electrophoresis, dialysis, 

chromatography, and ultrafiltration, if desired. 

In the practice of the subject invention any of the above- 
described antibodies may be labeled with a detectable 
marker. In one embodiment, the labeled antibody is a 
purified labeled antibody. The term "antibody" includes, 
by way of example, both naturally occurring and non- 
naturally occurring antibodies. Specifically, the term 
"antibody" includes polyclonal and monoclonal antibodies, 
and fragments thereof. Furthermore, the term "antibody" 
includes chimeric antibodies and wholly synthetic 
antibodies, and fragments thereof. A "detectable moiety" 
which functions as detectable labels are well known to 
those of ordinary skill in the art and include, but are 
not limited to, a fluorescent label, a radioactive atom, a 
paramagnetic ion, biotin, a chemiluminescent label or a 
label which may be detected through a secondary enzymatic 
or binding step. The secondary enzymatic or binding step 
may comprise the use of digoxigenin, alkaline phosphatase, 
horseradish peroxidase, S-galactosidase , fluorescein or 
steptavidin/biotin. Methods of labeling antibodies are 
well known in the art. 

Methods of recovering serum from a subject are well known 
to those skilled in the art. Methods of partially 
purifying antibodies are also well known to those skilled 
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in the art, and include, by way of example, filtration, 
ion exchange chromatography, and precipitation. 



The polyclonal and monoclonal antibodies of the invention 
may be labeled with a detectable marker. In one 

embodiment, the labeled antibody is a purified labeled 
antibody. The detectable marker may be, for example, a 
radioactive or fluorescent marker. Methods of labeling 
antibodies are well known in the art. 

Determining whether the polyclonal and monoclonal 
antibodies of the subject invention bind to cells, e.g. 
cancer cells, expressing an IRTA protein and form a 
complex with one or more of the IRTA protein (s) described 
herein, or fragments thereof, on the surface of said 
cells, may be accomplished according to methods well known 
to those skilled in the art. In the preferred embodiment, 
the determining is accomplished according to flow 
cytometry methods . 

The antibodies of the subject invention may be bound to an 
insoluble matrix such as that used in affinity 
chromatography. Cells which form a complex, i.e. bind, 
with the immobilized polyclonal or monoclonal antibody may 
be isolated by standard methods well known to those 
skilled in the art. For example, isolation may comprise 
affinity chromatography using immobilized antibody. 

Alternatively, the antibody may be a free antibody. In 
this case, isolation may comprise cell sorting using free, 
labeled primary or secondary antibodies. Such cell sorting 



methods are standard and are well known to those skilled 
in the art . 

This invention provides an antibody directed to a purified 
IRTA protein selected from the group consisting of IRTA1, 
IRTA2, IRTA3, IRTA4 and IRTA5 . In a preferred embodiment 
of the anti-IRTA antibody the IRTA protein is human IRTA 
protein. The IRTA protein may be any mammalian IRTA 
protein, including a murine IRTA protein. In a further 
embodiment of any the above-described antibodies, the 
antibody is a monoclonal antibody. In another embodiment, 
the monoclonal antibody is a murine monoclonal antibody or 
a humanized monoclonal antibody. As used herein, 
"humanized" means an antibody having characteristics of a 
human antibody, such antibody being non-naturally 
occurring, but created using hybridoma techniques wherein 
the antibody is of human origin except for the antigen 
determinant portion, which is murine. In yet another 
embodiment, the antibody is a polyoclonal antibody. 

In preferred embodiments, any of the antibodies of the 
subject invention may be conjugated to a therapeutic 
agent. In further preferred embodiments, the therapeutic 
agent is a radioisotope, toxin, toxoid, or 
chemotherapeutic agent. The conjugated antibodies of the 
subject invention may be administered to a subject having 
a B cell cancer in any of the methods provided below. 

This invention provides a pharmaceutical composition 
comprising an amount of the antibody directed to an IRTA 
protein effective to bind to cancer cells expressing an 
IRTA protein selected from the group consisting of human 



IRTA1, IRTA2, IRTA3 , IRTA4 and IRTA5 so as to prevent 
growth of the cancer cells and a pharmaceut ically 
acceptable carrier. The anti-IRTA antibody may be directed 
to an epitope of an IRTA protein selected from the group 
consisting of IRTA1, IRTA2 , IRTA3 , IRTA4 and IRTA5 . The 
IRTA proteins may be human or mouse IRTA proteins. 

In preferred embodiments of the above -described 
pharmaceutical composition, the cancer cells are selected 
from the group consisting of B cell lymphoma, multiple 
myeloma, a mantle cell lymphoma, Burkitt ' s lymphoma, 
marginal zone lymphoma, diffuse large cell lymphoma and 
follicular lymphoma cells. In another preferred 

embodiment of the pharmaceutical composition, the B cell 
lymphoma cells are Mucosa-Associated-Lymphoid Tissue B 
cell lymphoma (MALT) cells. In another preferred 
embodiment of the pharmaceutical composition, the B cell 
lymphoma cells are non-Hodgkin ' s lymphoma cells. 

This invention provides a pharmaceutical composition 
comprising an amount of any of the above-described 
oligonucleotides effective to prevent overexpression of a 
human IRTA protein and a pharmaceut ically acceptable 
carrier capable. In preferred embodiments of the 
pharmaceutical composition the oligonucleotide is a 
nucleic acid molecule which encodes an IRTA protein 
selected from the group consisting of IRTA1, IRTA2 , IRTA3 , 
IRTA4 and IRTA5 . The IRTA proteins may be human or mouse 
IRTA proteins . 



As used herein, "malignant" means capable of 
metastasizing. As used herein, "tumor cells" are cells 
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which originate from a tumor, i.e., from a new growth of 
different or abnormal tissue. The tumor cells and cancer 
cells may exist as part of the tumor mass, or may exist as 
free-floating cells detached from the tumor mass from 
which they originate. 

As used herein, malignant cells include, but are in no way 
limited to, B cell lymphoma, multiple myeloma, Burkitt 1 s 
lymphoma, mantle cell lymphoma, marginal zone lymphoma, 
diffuse large cell lymphoma and follicular lymphoma. The 
B cell lymphoma is Mucosa-Associated-Lymphoid Tissue B 
cell lymphoma (MALT) or is non-Hodgkin 1 s lymphoma. 

As used herein, "subject" is any animal or artificially 
modified animal. Artificially modified animals include, 
but are not limited to, SCID mice with human immune 
systems. In a preferred embodiment, the subject is a 
human . 

This invention provides a method of diagnosing B cell 
malignancy which comprises a lq21 chromosomal 
rearrangement in a sample from a subject which comprises: 
a) obtaining the sample from the subject; b) contacting 
the sample of step (a) with an antibody directed to a 
purified IRTA protein capable of specifically binding with 
a human IRTA protein selected from the group consisting of 
human IRTA1, IRTA2 , IRTA3 , IRTA4 and IRTA5 IRTA protein on 
a cell surface of a cancer cell under conditions 
permitting binding of the antibody with human IRTA protein 
on the cell surface of the cancer cell, wherein the 
antibody is labeled with a detectable marker; and c) 
detecting any binding in step (b) , wherein detecion of 
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binding indicates a diagnosis of B cell malignancy in the 
sample . 

In an embodiment of the above-described method of 
5 diagnosing B cell malignancy, the IRTA protein is selected 

from the group consisting of IRTA1 , IRTA2 , IRTA3 , IRTA4 
and IRTA5 . In another embodiment of the method the IRTA 
protein is human or mouse IRTA protein. In a further 
embodiment IRTA protein is purified. In a preferred 
10 embodiment of this method, the B cell malignancy is 

-3 selected from the group consisting of B cell lymphoma, 

multiple myeloma, Burkitt ' s lymphoma, marginal zone 
lymphoma, diffuse large cell lymphoma and follicular 
lymphoma. In yet another embodiment of this method, the B 
15 cell lymphoma is Mucosa-Associated-Lymphoid Tissue B cell 

lymphoma (MALT) . In another preferred embodiment of this 
method, the B cell lymphoma is non-Hodgkin ' s lymphoma. 

This invention provides a method of detecting human IRTA 
20 protein in a sample which comprises: a) contacting the 

sample with any of any of the above-described ant i- IRTA 
antibodies under conditions permitting the formation of a 
complex between the antibody and the IRTA in the sample ; 
and b) detecting the complex formed in step (a) , thereby 
25 detecting the presence of human IRTA in the sample. In an 

embodiement the IRTA protein detected may be an IRTA1 , 
IRTA2, IRTA3, IRTA4 or IRTA5 protein, having an amino acid 
sequence set forth in any of Figures 18A, 18B-1-18B-3, 
18C-1-18C-2, 18D-1-18D-2 or 18E-e-18E-2 . As described 
3 0 hereinabove detection of the complex formed may be 

achieved by using antibody labeled with a detectable 
marker and determining presence of labeled complex. 
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Detecting human IRTA protein in a sample from a subject is 
another method of diagnosing B cell malignancy in a 
subject. In an embodiment of this method of diagnosis, the 
B cell malignancy is selected from the group consisting of 
5 B cell lymphoma, multiple myeloma, Burkitt ' s lymphoma, 

marginal zone lymphoma, diffuse large cell lymphoma and 
follicular lymphoma. In yet another embodiment of this 
method, the B cell lymphoma is Mucosa-Associated-Lymphoid 
Tissue B cell lymphoma (MALT) . In another preferred 

10 embodiment of this method, the B cell lymphoma is non- 
- Hodgkin ' s lymphoma . 

This invention provides a method of treating a subject 
having a B cell cancer which comprises administering to 

11 the subject an amount of anti-IRTA antibody effective to 
bind to cancer cells expressing an IRTA protein so as to 
prevent growth of the cancer cells and a pharmaceutically 
acceptable carrier, thereby treating the subject. Growth 
and proliferation of the cancer cells is thereby inhibited 

20 amd the cancer cells die. In an embodiment of the above- 

described method, the IRTA protein is selected from the 
group consisting of human IRTA1, IRTA2 , IRTA3 , IRTA4 and 
IRTA5 . In a preferred embodiment of the above-described 
method of treating a subject having a B cell cancer, the 

25 anti-IRTA antibody is a monoclonal antibody. In another 

embodiment of the method, the monoclonal antibody is a 
murine monoclonal antibody or a humanized monoclonal 
antibody. The antibody may be a chimeric antibody. In a 
further embodiment, the anti-IRTA antibody is a 

30 polyoclonal antibody. In an embodiment, the polyclonal 

antibody may be a murine or human polyclonal antibody. In 
a preferred embodiment, the B cell cancer is selected from 



the group consisting of B cell lymphoma, multiple myeloma, 
Burkitt 1 s lymphoma, mantle cell lymphoma marginal zone 
lymphoma, diffuse large cell lymphoma and follicular 
lymphoma. In another preferred embodiment, the B cell 
lymphoma is Mucosa-Associated-Lymphoid Tissue B cell 
lymphoma (MALT) . In a further preferred embodiment, the B 
cell lymphoma is non-Hodgkin ' s lymphoma. In a preferred 
embodiment of the above-described method of treating a 
subject having a B cell cancer, administration of the 
amount of anti-IRTA antibody effective to bind to cancer 
cells expressing an IRTA protein is intravenous, 
intraperitoneal, intrathecal, intralymphat ical , 

intramuscular, intralesional , parenteral, epidural, 
subcutaneous; by infusion, liposome-mediated delivery, 
aerosol delivery; topical, oral, nasal, anal, ocular or 
otic delivery. In another preferred embodiment of the 
above-described methods, the anti-IRTA antibody may be 
conjugated to a therapeutic agent. In further preferred 
embodiments, the therapeutic agent is a radioisotope, 
toxin, toxoid, or chemotherapeutic agent. 

This invention provides a method of treating a subject 
having a B cell cancer which comprises administering to 
the subject an amount of an antisense oligonucleotide 
having a sequence capable of specifically hybridizing to 
an mRNA molecule encoding a human ITRA protein so as to 
prevent overexpression of the human IRTA protein, so as 
to arrest cell growth or induce cell death of cancer cells 
expressing IRTA protein (s) and a pharmaceut ically 
acceptable carrier, thereby treating the subject. 



In an embodiment of the above-described method of treating 
a subject having a B cell cancer, the IRTA protein is 
selected from the group consisting of human IRTA1 , IRTA2 , 
IRTA3 , IRTA4 and IRTA5 protein. In a preferred 

embodiment, B cell cancer is selected from the group 
consisting of B cell lymphoma, multiple myeloma, Burkitt ' s 
lymphoma, marginal zone lymphoma, diffuse large cell 
lymphoma and follicular lymphoma. In another preferred 
embodiment, the B cell lymphoma is Mucosa-Associated- 
Lymphoid Tissue B cell lymphoma (MALT) . In a yet another 
preferred embodiment, the B cell lymphoma is non-Hodgkin ' s 
lymphoma. In embodiments of any of the above-described 
oligonucleotides of nucleic acid molecules encoding the 
IRTA1 , IRTA2, IRTA3 , IRTA4 and/or IRTA 5 proteins, the 
nucleic acid may be genomic DNA or cDNA. In a further 
preferred embodiment of the above-described method of 
treating a subject having a B cell cancer, administration 
of the amount of oligonucleotide of effective to prevent 
overexpression of a human IRTA protein is intravenous, 
intraperitoneal, intrathecal, intralymphat ical , 

intramuscular, intralesional , parenteral, epidural, 
subcutaneous; by infusion, liposome-mediated delivery, 
aerosol delivery; topical, oral, nasal, anal, ocular or 
otic delivery. In an embodiment of the above-described 
methods, the oligonucleotide may be conjugated to a 
therapeutic agent. In further preferred embodiments, the 
therapeutic agent is a radioisotope, toxin, toxoid, or 
chemotherapeutic agent. 

The invention also provides a pharmaceutical composition 
comprising either an effective amount of the 
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oligonucleot ides or of the antibodies described above and 
a pharmaceutically acceptable carrier. In the subject 
invention an "effective amount" is any amount of an 
oligonucleotide or an antibody which, when administered to 
5 a subject suffering from a disease or abnormality against 

which the oligonucleotide or antibody are effective, 
causes reduction, remission, or regression of the disease 
or abnormality. In the practice of this invention the 
"pharmaceutically acceptable carrier" is any physiological 
10 carrier known to those of ordinary skill in the art useful 

in formulating pharmaceutical compositions. 

: = Pharmaceutically acceptable carriers are well known to 

those skilled in the art and include, but are not limited 
15 to, 0.01-0.1M and preferably 0 . 05M phosphate buffer or 

M= 0.8% saline. Additionally, such pharmaceutically 

acceptable carriers may be aqueous or non- aqueous 
solutions, suspensions, and emulsions. Examples of non- 
□ aqueous solvents are propylene glycol, polyethylene 

20 glycol, vegetable oils such as olive oil, and injectable 

organic esters such as ethyl oleate. Aqueous carriers 
include water, alcoholic/aqueous solutions, emulsions or 
suspensions, including saline and buffered media. 
Parenteral vehicles include sodium chloride solution, 
25 Ringer's dextrose, dextrose and sodium chloride, lactated 

Ringer's or fixed oils. Intravenous vehicles include 
fluid and nutrient replenishers , electrolyte replenishers 
such as those based on Ringer's dextrose, and the like. 
Preservatives and other additives may also be present, 
30 such as, for example, antimicrobials, antioxidants, 

chelating agents, inert gases and the like. 



In one preferred embodiment the pharmaceutical carrier may 
be a liquid and the pharmaceutical composition would be in 
the form of a solution. In another equally preferred 
embodiment, the pharmaceut ically acceptable carrier is a 
solid and the composition is in the form of a powder or 
tablet. In a further embodiment, the pharmaceutical 
carrier is a gel and the composition is in the form of a 
suppository or cream. In a further embodiment the 

compound may be formulated as a part of a pharmaceut ically 
acceptable transdermal patch. 

A solid carrier can include one or more substances which 
may also act as flavoring agents, lubricants, 
solubilizers, suspending agents, fillers, glidants, 
compression aids, binders or tablet -disintegrating agents; 
it can also be an encapsulating material. In powders, the 
carrier is a finely divided solid which is in admixture 
with the finely divided active ingredient. In tablets, 
the active ingredient is mixed with a carrier having the 
necessary compression properties in suitable proportions 
and compacted in the shape and size desired. The powders 
and tablets preferably contain up to 99% of the active 
ingredient. Suitable solid carriers include, for example, 
calcium phosphate, magnesium stearate, talc, sugars, 
lactose, dextrin, starch, gelatin, cellulose, 

polyvinylpyrrolidine, low melting waxes and ion exchange 
resins . 

Liquid carriers are used in preparing solutions, 
suspensions, emulsions, syrups, elixirs and pressurized 
compositions. The active ingredient can be dissolved or 
suspended in a pharmaceut ically acceptable liquid carrier 



such as water, an organic solvent, a mixture of both or 
pharmaceut ically acceptable oils or fats . The liquid 
carrier can contain other suitable pharmaceutical 
additives such as solubilizers , emulsifiers, buffers, 
preservatives, sweeteners, flavoring agents, suspending 
agents, thickening agents, colors, viscosity regulators, 
stabilizers or osmo- regulators . Suitable examples of 
liquid carriers for oral and parenteral administration 
include water (partially containing additives as above, 
e.g. cellulose derivatives, preferably sodium 

carboxymethyl cellulose solution) , alcohols (including 
monohydric alcohols and polyhydric alcohols, e.g. glycols) 
and their derivatives, and oils (e.g. fractionated coconut 
oil and arachis oil) . For parenteral administration, the 
carrier can also be an oily ester such as ethyl oleate and 
isopropyl myristate. Sterile liquid carriers are useful 
in sterile liquid form compositions for parenteral 
administration. The liquid carrier for pressurized 

compositions can be halogenated hydrocarbon or other 
pharmaceutically acceptable propellent. 

Liquid pharmaceutical compositions which are sterile 
solutions or suspensions can be utilized by for example, 
intramuscular, intrathecal, epidural, intraperitoneal or 
subcutaneous injection. Sterile solutions can also be 
administered intravenously. The compounds may be prepared 
as a sterile solid composition which may be dissolved or 
suspended at the time of administration using sterile 
water, saline, or other appropriate sterile injectable 
medium. Carriers are intended to include necessary and 
inert binders, suspending agents, lubricants, flavorants, 
sweeteners, preservatives, dyes, and coatings. 
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The pharmaceutical composition comprising the 
oligonucleotide or the antibody can be administered orally 
in the form of a sterile solution or suspension containing 
other solutes or suspending agents, for example, enough 
5 saline or glucose to make the solution isotonic, bile 

salts, acacia, gelatin, sorbitan monoleate, polysorbate 80 
(oleate esters of sorbitol and its anhydrides 
copolymerized with ethylene oxide) and the like. 

10 The pharmaceutical composition comprising the 

oligonucleotide or the antibody can also be administered 
orally either in liquid or solid composition form. 
Compositions suitable for oral administration include 
solid forms, such as pills, capsules, granules, tablets, 

1=1 and powders, and liquid forms, such as solutions, syrups, 

elixirs, and suspensions. Forms useful for parenteral 
administration include sterile solutions, emulsions, and 
suspensions . 

2 0 Optimal dosages to be administered may be determined by 

those skilled in the art, and will vary with the 
particular inhibitor in use, the strength of the 
preparation, the mode of administration, and the 
advancement of the disease condition or abnormality. 

25 Additional factors depending on the particular subject 

being treated will result in a need to adjust dosages, 
including subject age, weight, gender, diet, and time of 
administration . 



30 



This invention will be better understood from the 
Experimental Details which follow. However, one skilled 
in the art will readily appreciate that the specific 
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methods and results discussed are merely illustrative of 
the invention as described more fully in the claims which 
follow thereafter. 



EXPERIMENTAL DETAILS 

First Series of Experiments 



Molecular analysis of chromosomal translocations 
associated with multiple myeloma (MM) has indicated that 
the pathogenesis of this malignancy may be heterogeneous, 
being associated with several distinct oncogenes including 
BCL-1, MUM-1 and FGFR3 . Structural abnormalities of 

chromosome lq21, including translocations with chromosome 
14q32, represent frequent cytogenetic aberrations 
associated with multiple myeloma. In order to identify 
the genes involved in these translocations, the breakpoint 
regions corresponding to both derivatives of a 
t (1;14) (q21;q32) detectable in the FR4 human plasmacytoma 
cell line were cloned. Analysis of the breakpoint 
sequences showed that they involved a reciprocal 
recombination between the Immunoglobulin heavy chain (IgH) 
locus on 14q32 and unknown sequences on lq21. The normal 
locus corresponding ot hte lq21 region involved in the 
translocation was cloned and athe genes adjacent to the 
breakpoint region were identified by an exon- trapping 
strategy. Two genes were found, located within a 20 Kb 
distance from each other, in the region spanning the 
breakpoint on lq21. The first gene, called MUM-2 

(multiple myeloma-2) is expressed as a 2.5 Kb mRNA 
transcript detectable in spleen and lymph nodes. Cloning 
and sequencing of the full-length MUM- 2cDNA predicts a 515 
amino acid cell surface glycoprotein containing four 
extracellular Ig-type domains, a transmembrane and a 
cytoplasmic domain and sharing a 3 7% identity 
(51%homology) with Fc gamma receptor I over its first 
three extracellular domains. In FR4 cells, the 



translocation breakpoints interrupt the MUM-2 coding 
domain and juxtapose it to the IgH locus in the same 
transcriptional orientation. As a consequence, 

structurally abnormal FR4-specific MUM-2 transcripts (3.0, 
5.2 and 6.0 Kb) in lymph nodes and spleen and encodes a 
protein with an extracellular domain containing six Ig- 
type domains homologous to members of the Fc gamma and Ig- 
type adhesion receptor families. The structure of the 
MUM-2 and MUM- 3 genes and their direct involvement in a 
MM-associated translocation suggest that these genes code 
for novel cell surface receptors important for normal 
lymphocyte function and B cell malignancy. 

Second Series of Experiments 
Experimental Procedures 

Cell Lines 

The MM cell lines used in this study (FR4, XJ266 , JJN3, 
EJM, SKMM1, RPMI-822 6, XG1, XG2 , XG4 , XG6 , XG7) have been 
previously reported (Tagawa et al . , 1990), (Jernberg et 
al., 1987), (Hamilton et al . , 1990; Jackson et al . , 1989), 
(Eton et al., 1989), (Zhang et al . , 1994). The FR4 cell 
line was established in the laboratory of one of the 
authors (S.T) . The U266, JJN3 , and EJM cell lines were 
gifts from Dr. K. Nilsson (University of Uppsala, Uppsala, 
Sweden) and the SKMM-1 cell line was a gift of A.N. 
Houghton (Memorial Sloan Kettering Cancer Center, New 
York, NY) . The five XG cell lines were obtained from Dr. 
Bernard Klein and cultured in the presence of 1 ng/ml 
human recombinant IL-6 as described previously (Zhang et 
al., 1994). The BL cell lines with lq21 abnormalities 



have been previously described (Polito et al . , 1995), 
(Magrath et al . , 1980) and were grown in RPMI , 10% FCS . 

Genomic and cDNA library screening and DNA sequence 
analysis 

Two genomic libraries were constructed from FR4 genomic 
DNA either by BamHI complete digestion or by Sau3AI 
partial digestion and subsequent ligation of gel-purified 
fractions into the 1DASH-II phage vector (Stratagene) . The 
BamHI library was screened with a 4.2 kb XhoI-BamHI probe 
derived from the Ca locus and the Sau3AI library was 
screened with a 5 ' Sa probe previously described (Bergsagel 
et al., 1996). A human placental DNA library (Stratagene) 
was screened with probe 1 . OEH (Figures 8A-8C) to obtain 
the germline lq21 locus. Library screening and plaque 
isolation were preformed according to established 
procedures (Sambrook et al . , 1989). IRTA1 and IRTA2 cDNA 
clones were isolated from an oligo-dT/random-primed cDNA 
library constructed from normal human spleen RNA 
(Clontech) . The IRTA1 cDNA probe used for library 

screening was obtained from RT-PCR of human spleen cDNA 
using primers flanking exons 1 and 3. DNA sequencing was 
preformed on an ABI 373 automated sequencer (Applied 
Biosystems) . Sequence homology searches were carried out 
through the BLAST e-mail server at the National Center for 
Biotechnology Information, Bethesda, MD . 

PAC and YAC isolation and exon trapping 

Human PAC clones were obtained by screening a human PAC 
library spotted onto nylon membranes (Research Genetics) , 
with the 1.0 EH probe (Figures 8A-8C) . The Zeneca 



(formerly ICI) human YAC library (Anand et al . , 1990) 
obtained from the United Kingdom Human Genome Mapping 
Resource Center (UK-HGMP) was screened using a PCR-based 
pooling strategy. Exon trapping was performed using the 
exon trapping system (Gibco BRL) , according to the 
manufacturer' s instructions . 

Isolation of PAC/YAC end clones, pulsed- field gel 
electrophoresis (PFGE) and fluorescence in situ 
hybridization (PISH) analysis 

PAC DNA extraction was performed according to standard 
alkaline lysis methods (Drakopoli N et al . , 1996). A 
vectorette-PCR method was used to isolate PAC and YAC end 
probes (Riley et al . , 1990), as previously described (Iida 
et al, 1996) . PFGE analysis was performed according to 
standard protocols (Drakopoli N et al., 1996) using the 
CHEF Mapper system (BioRad, Hercules, CA) . Biotin 
labeling of PAC DNA, chromosome preparation and FISH were 
performed as previously described (Rao et al . , 1993). 

Southern and Northern blot analyses, RACE and RT-PCR 

Southern and northern blot analyses were performed as 
described previously (Neri et al, 1991) . For Northern 
blot analyses total RNA was prepared by the guanidium 
thiocyanate method and poly (A) RNA was selected using 
poly (T) -coated beads (Oligotex Kit by Quigen) . For 
Northern blots, 2 mg of poly (A) RNA were loaded per lane. 
Multiple tissue Northern filters were obtained from 
Clontech. RACE was performed using the Marathon cDNA 
Amplification kit (Clontech) and Marathon -Ready spleen 



cDNA. First strand cDNA synthesis was performed using the 
Superscript RT-PCR system (Gibco BRL) 



In situ hybridization 

Digoxigenin-containing antisense and sense cRNA probes 
were transcribed with T3 and T7 RNA polymerase, 
respectively, from linearized pBluescript KS+ plasmids 
containing coding region of cDNAs (nucleotides 62 to 1681 
of IRTA1 and 18 to 2996 of IRTA2 . ) Hyperplastic human 
tonsillar tissue surgically resected from children in 
Babies' Hospital, Columbia Presbyterian Medical Center was 
snap frozen in powdered dry ice. Cryostat sections were 
stored for several days at -80 degrees C prior to 
processing. Non-radioactive in situ hybridization was 
performed essentially as described (Frank et al . , 1999), 
except that fixation time in 4% paraformaldehyde was 
increased to 2 0 minutes, and proteinase K treatment was 
omitted. The stringency of hybridization was 6 8 degrees 
C, in 5X SSC, 5 0% f ormamide . Alkaline phosphatase- 
conjugated ant i -digoxigenin antibody staining was 
developed with BCIP/NBT substrate. 

Transf ection, immunoprecipitation and Western Blotting 

2 93 cells (ATCC) , grown in DMEM, 10% FCS were transiently 
transfected, according to the standard calcium phosphate 
method, with pMT2T and pMT2T- IRTAl/Ca transient expression 
constructs. The latter was generated using the 

IRTAl/Ca RT-PCR product from FR4 . Cells (2x10 s of 

transfectants and 2xl0 7 of remaining cell lines) were 
solubilized in Triton X-100 lysis buffer (150 mM NaCl, 10 
mM Tris-HCl [pH 7.4], 1% Tx-100, 0.1% BSA) in the presence 
of a protease inhibitors coctail (Roche Biochemicals ) . 
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Lysates were incubated at 4°C for 2 hours with 4 mg/ml of 
the monoclonal antibody #117-332-1 (Yu et al . , 1990) 
(Tanox Biosystems, Inc, Houston, Texas) that was raised 
against the extracellular portion of the IgA membrane 
peptide. Immune complexes were isolated with protein G- 
Sepharose (Pharmacia) prior to electrophoresis on 10-20% 
Tris-HCl gradient gels (Biorad) and immunobloting, using 
15 mg/ml of the #117-332-1 antibody. Results were 

visualized by ECL (Amersham) . 

RESULTS 

Molecular Cloning of the t(l;14) (q21;q32) 

Chromosomal translocations involving the Ig heavy-chain 
(IGH) locus often occur within or near IgH switch regions 
as a result of "illegitimate" switch recombination events 
(Dalla-Favera et al . , 1983 ; Chesi et al . , 1996; Chesi et 
al., 1998) . The' breakpoints can be detected by Southern- 
blot hybridization assays as rearranged alleles in which 
the IGH constant (C H ) region sequences have lost their 
syntenic association with IGH joining (J H ) and 5' switch 
region (S) sequences (Dalla-Favera et al . , 1983; Neri et 
al., 1988; Neri et al . , 1991; Bergsagel et al . , 1996). 
This assay has led to the identification of several 
chromosomal partners for the IgH locus in B - NHL and MM 
(Taub et al . , 1982; Dalla-Favera et al . , 1983 ; Neri et 
al-, 1988; Neri et al . , 1991; Ye et al . , 1993 ; Chesi et 
al., 1996; Richelda et al., 1997; Iida et al . , 1997; 
Dyomin et al . , 1997; Dyomin et al . , 2000). We employed 
the same strategy in order to clone the lq21 breakpoint 
region in FR4 , a myeloma cell line carrying a 
t(l;14) (q21;q32), as determined by cytogenetic analysis 



(Tagawa et al . , 1990; Taniwaki M, unpublished results) . 
Two "illegitimately" rearranged fragments were identified 
within the Ca heavy- chain locus in FR4 by Southern blot 
hybridization analysis (data not shown) , and were cloned 
from phage libraries constructed from FR4 genomic DNA. 
Restriction mapping, Southern blot hybridization and 
partial nucleotide sequencing of two genomic phages 
(clones A FR4B-5 and X FR4S-a, Figure 8A) demonstrated that 
they contained the chromosomal breakpoints of a reciprocal 
balanced translocation between the Ccc 1 locus on 14q32 and 
non-IGH sequences. A probe (1.0EH) representing these 
non-IgH sequences (Figure 8A) was then used to clone the 
corresponding normal genomic locus from phage, PI 
artificial chromosome (PAC) , and yeast artificial 
chromosome (YAC) human genomic libraries. Fluorescence in 
situ hybridization (FISH) analysis of normal human 
metaphase spreads using the 100 -kb non-chimaeric PAC clone 
4 9A16 which spans the breakpoint region (see below, Figure 
13) , identified the partner chromosomal locus as derived 
from band lq21 (Figure 8C) . Mapping to a single locus 
within chromosome 1 was confirmed by hybridization of two 
non-repetitive probes to DNA from a somatic-cell hybrid 
panel representative of individual human chromosomes (data 
not shown) . These results were consistent with the 
cloning of sequences spanning the t ( 1 ; 14 ) (q21 ; q32 ) in FR4 . 

Sequence analysis of the breakpoint regions on the 
derivative chromosomes and alignment with the germline 
14q32 and lq21 loci revealed that the breakpoint had 
occurred in the intron between the CH3 and the 



transmembrane exon of Co^ on chromosome 14 . Although the 
breakpoint region was devoid of recombination signal 
sequences (RSS) or switch signal sequences (Kuppers et 
al., 1999), the sequence CTTAAC (underlined on Figure 8B) 
was present in both germline chromosomes 14 and 1 at the 
breakpoint junction. One copy of this sequence was 
present in each of the derivative chromosomes, with a 
slight modification in the der(l) copy (point mutation in 
the last nucleotide: C to G) . The nucleotides AT 

preceding CTTAAC on chromosome 1 were also present in both 
derivative chromosomes (Figure 8B) . The translocation did 
not result in any loss of chromosome 1 sequences . On the 
other hand, in the chromosome 14 portion of der(l) we 
observed two deletions upstream of the breakpoint 
junction: a 16 nucleotide deletion (GGCACCTCCCCTTAAC) and 
a 4 nucleotide deletion (TGCA) 6 nucleotides upstream 
(Figure 8B) . These observations indicate that the 

t (1;14) (q21;q32) in FR4 cells represents a balanced 
reciprocal translocation possibly facilitated by the 
presence of homologous sequences (CTTAAC) on both 
chromosomes . 

The lq21 breakpoint region contains genes coding for novel 
members of the Immunoglobulin Receptor Superfamily 

We next investigated whether the region of chromosome lq21 
spanning the translocation breakpoint in FR4 contains a 
transcriptional unit. DNA from partially overlapping PAC 
clones 49A16 and 210K22 (Figure 13) was "shotgun" cloned in 
plasmids, sequenced and analyzed for homology to known 
genes in human genome databases. In parallel, candidate 



genes on the 49A16 PAC were sought by an exon trapping 
strategy (Church et al., 1994). 



Mapping of the candidate exons on the lq21 genomic clones 
revealed that the FR4 breakpoint had occurred between two 
trapped exons (see below, Figure 13) , which belonged to 
the same transcript since they could be linked by RT-PCR 
using spleen RNA. This RT-PCR product was then used as a 
probe to screen a spleen cDNA library in order to isolate 
full-length clones corresponding to this transcript. Two 
sets of cDNA clones were identified, belonging to two 
distinct transcripts and sharing a 76% mRNA sequence 
identity within the 443 bp probe region. Full length cDNA 
clones for both transcripts were obtained by rapid 
amplification of cDNA ends (RACE) on human spleen cDNA 
that generated 5' and 3' extension products. 

The schematic structure of the cDNA representing the first 
transcript is depicted in Figure 9A. Alternate usage of 
three potential polyadenylat ion sites in its 3 ' 
untranslated region gives rise to three mRNA species of 
2.6, 2.7 and 3.5 kb, encoding the same putative 515 -amino 
acid protein (Figure 9A) . The predicted features of this 
protein include a signal peptide, in accordance with the 
[-3, -1] rule (von Heijne, 1986), four extracellular Ig- 
type domains carrying three potential asparagine (N) - 
linked glycosylat ion sites (Figure 9A) , a 16 amino acid 
transmembrane and a 106 amino acid cytoplasmic domain with 
three putative consensus Src-homology 2 (SH2 ) -binding 
domains (Unkeless and Jin, 1997) (Figure 10B) . These 
(SH2) -binding domains exhibit features of both IT AM 
(immune -receptor Tyrosine-based Activation Motif 



-D/EX 7 D/EX 2 YXXL/IX S . 8 YXXL/I ; where X denotes non-conserved 
residues) (Reth, 1989) and I TIM motifs (immune-receptor 
Tyrosine-based inhibition Motif - S/V/L/IYXXL/V where X 
denotes non-conserved residues) (Unkeless and Jin, 1997) . 
As shown in Figure 10B, the first two SH2 -binding domains 
are spaced 8 aminoacids apart, consistent with the 
consensus ITAM motif. Diverging from the consensus, the 
glutamate residue <E) is positioned four rather than two 
aminoacids before the first tyrosine (Y) (Figure 10B) , and 
the +3 position relative to tyrosine (Y) is occupied by 
valine (V) rather than leucine (L) or isoleucine (I) 
(Cambier, 1995) . All three domains conform to the ITIM 
consensus and each is encoded by a separate exon, as is 
the case for ITIM. Thus their arrangement may give rise 
to three ITIM or possibly to one ITAM and one ITIM. The 
overall structure of this protein suggests that it 
represents a novel transmembrane receptor of the Ig 
super family and it was therefore name IRTA1 (immune 
Receptor Translocation Associated gene 1) . 

The second cDNA shares homology to IRTA1 (6 8% nucleotide 
identity for the length of the IRTA1 message encoding its 
extracellular domain) and was named IRTA2 . The IRTA2 
locus is more complex than IRTA1 and is transcribed into 
three major mRNA isoforms {IRTA2a, IRTA2b, IRTA2c) of 
different molecular weight (2.8, 4.7 and 5.4 kb 
respectively), each with its own unique 3' untranslated 
region (Figure 9B) . In addition, a 0.6 kb transcript 
(Figure 12A) arises from the usage of an early 
polyadenylation signal at nucleotide 536 of IRTA2 . The 
three predicted IRTA2 protein isoforms encoded by these 
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transcripts share a common aminoacid sequence until 
residue 560, featuring a common signal peptide and six 
extracellular Ig-type domains (Figure 9B) . IRTA2a encodes 
for a 759 aa secreted glycoprotein with eight Ig-type 
5 domains followed by 13 unique, predominantly polar 

aminoacids at its C-terminus. IRTA2b diverges from IRTA2a 
at amino acid residue 560, and extends for a short stretch 
of 32 additional residues, whose hydrophobicity is 
compatible with its docking to the plasma membrane via a 
10 GPI -anchor (Ferguson and Williams, 1988) . IRTA2c is the 

O longest isoform whose sequence deviates from IRTA2a at 

\i aminoacid 746. It encodes a 977 aa type I transmembrane 

Z glycoprotein with nine extracellular Ig-type domains, 

:" harboring eight potential N- linked glycosylation sites, a 

15* 2 3 aminoacid transmembrane and a 104 aminoacid cytoplasmic 

domain with three consensus SH2 -binding motifs (Figure 
10B) . Each of the SH2 -binding sites in IRTA2c agrees with 
the I TIM consensus (Figure 10B) and is encoded by a 
p{ separate exon. These features suggest that IRTA2c is a 

2 0 novel transmembrane receptor of the Ig superfamily with 

secreted and GPI -linked isoforms. 

Homology between the IRTA proteins and Immunoglobulin 
Superfamily Receptors 

25 

Amino acid alignment of the entire extracellular domains 
of the IRTA1 and IRTA2 proteins to each other and to other 
Ig superfamily members revealed a remarkable homology 
between them (47% identity and 51% similarity) and a 

3 0 lower, but striking homology to the Fc gamma receptor 

family of proteins. This homology was stronger in the 
aminoacid positions conserved among the different classes 



of Fc receptors. Among Fc receptors, the high affinity 
IgG receptor FCGRI (CD64) shared the highest levels of 
homology with the first three Ig-domains of IRTA1 and 
IRTA2 (37% identity and 50% similarity) throughout its 
entire extracellular portion (Figure 10A) . Lower levels 
of homology were observed between the IRTA proteins and 
the extracellular domains of other cell surface molecules, 
including human platelet endothelial cell adhesion 
molecule (PECAM1) , B-lymphocyte cell adhesion molecule 
(CD22) and Biliary Glycoprotein 1 (BGP1) (22-25% identity, 
38-41% homology) . 

No homology is apparent between the IRTAs and members of 
the Fc receptor family in their cytoplasmic domains. In 
contrast, significant aminoacid homology is present 
between IRTA1 and PECAM1 (31% aminoacid identity and 45% 
homology) , IRTA2c and BGP1 (30% identity, 35% homology) 
and IRTA2c and PECAM1 (28% identity, 50% homology) (Figure 
10B) . These homologies suggest employment of similar 
downstream signaling pathways by these different proteins. 

IRTA1 and IRTA2 are normally expressed in specific 
subpopulations of B cells 

The normal expression pattern of the IRTA1 and IRTA2 mRNAs 
was first analyzed by Northern blot hybridization of RNA 
derived from different normal human tissues and from human 
cell lines representing different hematopoietic lineages 
and stages of B-cell development. 



IRTA1 expression was detected at a very low level in human 
spleen and lymph node RNA {Figure 11A, left panel) and was 
undetectable in all other human tissues analyzed, 
including fetal liver, bone marrow, lung, placenta, small 
intestine, kidney, liver, colon, skeletal muscle, heart 
and brain (data not shown) . Among B cell lines, IRTA1 
expression was absent in cell lines representing pre-B and 
germinal center B-cells, plasma cells and cells of 
erythroid, T-cell and myeloid origin (data not shown, see 
Materials and Methods) . Expression was detectable at very 
low levels only in EBV- immortalized lymphoblastoid cell 
lines (LCD , which represent a subpopulat ion 
(immunoblasts) positioned downstream of germinal center B 
cells in B-cell differentiation. However, expression was 
induced in estrogen-deprived ER/EB cells which, being 
immortalized by a recombinant EBV genome in which the 
EBNA2 gene is fused to the estrogen receptor, proliferate 
in the presence of estrogen while they arrest in the G 0 /G 1 
phase upon estrogen deprivation (Kempkes et al . , 1995). 
IRTA1 expression was barely detectable in these cells in 
the presence of estrogen, but was induced (10-fold) upon 
their G 0 /G ± arrest following estrogen withdrawal (Figure 
11A, right panel) . Taken together, these results suggest 
that IRTA1 is expressed in a lymphoid subpopulat ion 
present in spleen and lymph nodes and presumably 
represented by resting B cells. 

To further investigate the phenotype and tissue 
distribution of the cells expressing IRTA1 , we performed 
in situ hybridization on human tonsillar tissue using a 
IRTA1 antisense cDNA probe (Figure 11B) . Serial sections 
were processed for in situ hybridization with a control 



-70- 

sense cDNA probe (Panel # 1 in Figure 11B) , an antisense 
cDNA probe (Panel # 2) and hematoxylin and eosin (H&E) 
staining (Panel #3) to outline the architecture of the 
lymphoid tissue. The IRTA1 hybridization signal was 
excluded from the germinal center and the mantle zone of 
the follicles and was characteristically concentrated in 
the perifollicular zone with infiltrations in the intra- 
epithelial region (Figures 11B-2, 11B-4). In this region, 
only B cells were positive as documented by staining with 
B cell specific markers (IgD, not shown) , and by 
immnunohistochemical analysis with anti-IRTAl and anti-B 
(CD20, PAX5) , anti-T (CD 3 ) , and ant i -monocyte (CD68) 
antibodies (not shown; G. Cattoretti et al . , manuscript in 
preparation) . This perifollicular area is the "marginal 
zone" equivalent of the tonsil, representing a functionally 
distinct B-cell compartment that contains mostly memory B- 
cells and monocytoid B-cells (de Wolf-Peeters et al . , 
1997) . Together with the Northern blot analysis of normal 
tissues and cell lines, these results indicate that IRTA1 
is expressed in a subpopulation of resting mature B-cells 
topographically located in the perifollicular and 
intraepithelial region, sites rich in memory B cells. 

In the case of IRTA2 , Northern blot analysis detected all 
alternatively spliced species in human lymph node, spleen, 
bone marrow and small intestine mRNA, with relative 
preponderance of the IRTA2a isoform (Figure 12A, left 
panel) . Among the hematopoietic cell lines of lymphoid 
and non- lymphoid origin tested, IRTA2 expression was 
restricted to B-cell lines with an immunoblast ic , post- 
germinal center phenotype (Figure 12A, right panel) . 
Similarly to IRTA1 , it was absent from cell lines derived 



from pre-B cells, germinal center centroblasts , plasma 
cells, T-cells, erythroid cells and myeloid cells (Figure 
12A, right panel) . 

In situ hybridization analysis of human tonsillar tissue, 
using the IRTA2c cDNA as a probe, was consistent with the 
results of the Northern blot analysis. The IRTA2 mRNA was 
largely excluded from the mantle zone of the germinal 
center, with the exception of a few positive cells 
(Figures 12B-2, 12B4) . Within the germinal center, the 
dark zone, represented by centroblasts, appeared negative 
for IRTA2, while the light zone, rich in centrocytes, was 
strongly positive (Figures 12B-2, 12B-4) . Finally, IRTA2 
mRNA was detected in the "marginal zone" equivalent region 
outside germinal center follicles and in the 
intraepithelial and interf ollicular regions of the tonsil. 
This pattern is consistent with specificity of IRTA2 for 
centrocytes and post-germinal center B cells. Comparing 
their expression patterns, we conclude that both are 
specific for mature B cells, but IRTA2 has a broader 
pattern of expression that includes centrocytes and 
interf ollicular B cells, while IRTA1 is restricted to 
marginal zone B cells, most likely memory cells. 

Genomic organization of the IRTA1 and IRTA2 genes 

To understand the consequences of lq21 abnormalities on 
IRTA1 and IRTA2 gene structure and expression, we first 
determined the organization of their genomic loci . The 
IRTA1 gene contains 11 exons with a total genomic size of 
24.5 kb (Figure 13). The IRTA2 locus was found to span a 
genomic region of approximately 40 kb (Figure 13) . The 
three IRTA2 alternatively spliced products share their 



first 8 exons, at which point IRTA2b does not utilize the 
next splicing site, and terminates by entering its 3 ' UTR 
region. IRTA2a and 2c isoforms splice into exon 9, with 
IRTA2a entering into its 3 ' UTR after exon 11 and IRTA2c 
splicing into exon 12 and extending until exon 18 (Figure 
13) . 

Based on sequencing data, we determined that the IRTA1 and 
IRTA2 genes are located 21 kb distant from each other, 
juxtaposed in the same transcriptional orientation (Figure 
13) that extends from the telomere (5') towards the 
centromere (3'). At the lq21 locus, they are tightly 
linked to each other as well as to three additional genes 
we recently cloned through their homology to the IRTAs 
(I.M, manuscript in preparation). All five genes are 
contiguous, covering a -3 0 0 kb region at lq21 . This 
region is located at the interval between previously 
reported lq21 breakpoints. Based on the distance between 
genomic clones harboring the respective genes on the 
Whitehead Institute Radiation Hybrid map, the IRTA1-2 
locus is estimated to lie approximately 0.8 Mb away from 
the MUC1 locus towards the telomere (N.P, unpublished 
data; Dyomin et al . , 2000; Gilles et al . , 2000) and less 
than or equal to 7 Mb away from the FCGRIIB locus towards 
the centromere (N.P, unpublished data) . 

The t(l;14) (q21;q32) translocation generates an IRTAl/Cai 
fusion protein in the FR4 myeloma cell line 

Comparative restriction and nucleotide sequence analysis 
of germline versus rearranged sequences from the Ca x and 
IRTA1 loci showed that the translocation had fused 



sequences within intron 2 of the IRTA1 gene to the 
intronic sequences between the CH3 and the transmembrane 
exon of Ca 1 in the same transcriptional orientation (Figure 
14A) . This suggested that, if IRTA1 sequences were 
expressed in the translocated locus, the intact donor site 
at the 3 ' border of the IRTA1 exon and the intact acceptor 
site at the 5' of Ca 1 could be used to generate a fusion 
IR TA1 / Ca 2 mRNA , and possibly a IRTAl/Caj. fusion protein. 

In order to test this prediction, we analyzed IRTA1 mRNA 
expression in FR4 by Northern blot analysis using an IRTA1 
cDNA probe derived from exon 1 (Figure 14A) . This probe 
detected a 0.8 kb message in FR4 that was absent from 
other B-cell lines, and was shorter than the normal 2.5 kb 
message detectable in ER/EB cells (Figure 14B) . We cloned 
this transcript by RT-PCR of FR4 mRNA using primers 
derived from sequences at the 5' border of IRTA1 exon 1 
and the 3' border of the cytoplasmic exon (Figure 14A) . 
An RT-PCR product was obtained from FR4 , but not from the 
DAKIKI cell line expressing wild-type surface IgA, or 
other cell lines lacking a t (1,-14) translocation (data not 
shown) . Direct sequencing analysis of the PCR product 
indicated that splicing had precisely linked IRTA1 and 
Ca 2 at canonical splicing sites and determined that the 
fusion transcript was 820 bp long. 

Analysis of the predicted protein product indicated that 
the IRTAl/Ca?! splicing had resulted in a fusion between the 
IRTA1 signal peptide and first two extracellular 
aminoacids, with the 32 -amino acid long extracellular 
spacer, transmembrane domain and cytoplasmic tail of the 



membrane IgA-L (mlgAj receptor (Figure 14C) . To assay for 
the expression of this fusion protein in FR4 protein 
extracts, we used an antibody directed against 
extracellular aminoacid residues specific for the 
transmembrane isoform of Co^ (Yu et al . , 1990) for 
immunoprecipitation, followed by Western blotting. Our 
results demonstrated that FR4 cells, but not a control 
cell line (DAKIKI) expressing wild-type surface IgA, 
express a 9 . 8kDa protein consistent with the predicted 
size of IRTA1/CCC! fusion protein (Figure 14D) . These 
results show that the translocated allele encodes a fusion 
protein, composed of the signal peptide and first two 
extracellular residues of IRTA1 (17 aminoacids) fused to 
the Ca 1 encoded transmembrane and cytoplasmic domains (71 
aminoacids) . In contrast to IRTA1/Ca 1 overexpression on 
der(14), no expression was detected in FR4 for the 
reciprocal Ca z /IRTA1 transcript or for the intact IRTA2 
gene on der (1) . 

With the exception of FR4 , IRTA1 mRNA expression was not 
detected in any other myeloma or lymphoma cell line, 
regardless of the status of its chromosomal band lq21 
(data not shown) . Thus, the IRTAl/Ca fusion represents a 
rare event in lq21 aberrations. 

Frequent deregulation of IRTA2 expression in cell lines 
carrying lq21 abnormalities 

In order to establish the physical relationship between 
other lq21 breakpoints and the IRTAl/2 locus, we performed 
FISH analysis with the PAC 4 9A16 on our panel of BL and MM 
cell lines. Among ten BL cell lines analyzed, seven with 



dup(l) (q21q32) and three with lq21 translocations (AS283A, 
BL104, BL136), we detected three signals corresponding to 
the IRTA1/IRTA2 locus in seven of the former and two of 
the latter, consistent with dup (1) (q21q32 ) in the first 
case and dup(l) (q21q32) followed by a translocation 
breakpoint at lq21 in the second. (Table 1) . FISH 
analysis of AS283A and BL136, using probes spanning the 
IRTA locus and with neighboring genomic clones, placed the 
breakpoint of the derivative chromosomes outside the IRTA 
locus in both cell lines, at a distance of >800 kb towards 
the centromere in AS283A and >800 kb towards the telomere 
in BL136 (N.P, unpublished results). Consistent with this 
finding, analysis of 30 cases of MM primary tumors by 
interphase FISH with the 300-kb YAC 23GC4 (Figure 13) , 
showed that 15 cases (50% of total analyzed) had more than 
two interphase FISH signals (data not shown) , while double 
color FISH with two PAC clones flanking the YAC 
centromeric and telomeric borders detected no split of 
these two probes in any of the cases. These results 
indicate that, with the exception of FR4 , the breakpoints 
of lq21 aberrations in BL or MM are not within or in close 
proximity to the genomic region defined by IRTA1 and 
IRTA2 . However, the consistent outcome of either 

dup(l) (q21q32) (see Table 1) or dup(l) (q21q32) followed by 
unbalanced translocations (AS283A, BL136, XG2 , XG7 in 
Table 1) is partial trisomy or tetrasomy of the region of 
lq21 containing the IRTA genes. 
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We then investigated whether these aberrations had an 
effect on IRTA2 mRNA expression. To this end, we used a 
cDNA probe corresponding to the IRTA2 5' untranslated 
region to screen a Northern blot with a panel of B -NHL and 
MM cell lines lacking or displaying lq21 chromosomal 
abnormalities. The results show that most (ten out of 
twelve) BL lines with normal lq21 chromosomes essentially 
lack IRTA2 expression, consistent with the fact that BL 
derive from GC centroblasts which normally lack IRTA2 
expression {Figure 15A, left panel) . In contrast, most BL 
lines carrying lq21 abnormalities (ten out of twelve) 
clearly display IRTA2 mRNA upregulation (Figure 15A, right 
panel) , ranging from 2 to 50 fold over baseline levels 
detected in BL with normal lq21. Among myeloma cell 
lines, IRTA2 was overexpressed in one out of three lines 
displaying lq21 abnormalities (XG2) , while it was 
expressed in none out of seven with normal lq21 (Figure 
15B) . 

These results show a strong correlation between the 
presence of lq21 chromosomal aberrations and deregulation 
of IRTA2 mRNA expression in BL and suggest that trisomies 
of the IRTA2 locus may deregulate its expression in this 
lymphoma subtype (see Discussion) . 

Discussion 

Efforts described herein to identify genes involved in 
chromosomal aberrations affecting band lq21 in Multiple 
Myeloma and B cell lymphoma, led to the discovery of IRTA1 
and IRTA2, two founding members of a novel subfamily of 
related receptors within the immunoreceptor family; full 
length nucleic acid sequences encoding IRTA1 and IRTA2 



proteins are provided herein, as are the amino acid 
sequences of the encoded IRTA1 and IRTA2 proteins. 
Subsequently three additional genes of members of this 
subfamily of related receptors were isolated, IRTA3 , 
IRTA4, and IRTA5 , the full length nucleic acid sequences 
of which are provided herein, as are the amino acid 
sequences of the encoded IRTA3 , IRTA4 , and IRTA5 proteins. 
These results have implications for the normal biology of 
B cells as well as for the role of lq21 aberrations in 
lymphomagenesis . 

IRTA1 and IRTA2 are founding members of a new subfamily 
within the Ig super family 

Several features shared between the two IRTA genes and 
their encoded proteins suggest that they form a new 
subfamily within the immunoreceptor superfamily. First, 
they share a higher degree of homology with each other in 
their extracellular domains than with other superfamily 
members both in their mRNA (68% identity) and protein (47% 
identity) sequence. Second, they share homology in their 
cytoplasmic domains, marked by the presence of ITAM-like 
and ITIM signaling motifs in the context of homologous 
aminoacid sequences. Third, IRTA1 and IRTA2 belong to a 
larger subfamily of five genes displaying higher 
intrafamily homology and tight clustering within a -3 00 kb 
region at lq21 (I.M. et al . , manuscript in preparation). 
Their genomic organization suggests that a common 
ancestral gene may have given rise to this subfamily, by a 
process of duplication and sequence divergence, similar to 
the mechanism proposed for the Fc receptor family (Qiu et 
al . , 1990) . 

In their extracellular domain, the IRTA proteins are 
closely related to the Fc receptor subfamily based on the 
high degree of aminoacid homology shared especially with 
the high affinity FCGRI receptor (37-45% aminoacid 



identity) . A common evolutionary origin with Fc receptors 
is also suggested by the position of the IRTA family locus 
in the interval between the FCGRI locus on lq21 and the 
FCERI and FCGRII-III loci on Iq21-q23 . Finally, the IRTA 
and FCR genes share a similar exon/intron organization of 
the gene portion that encodes their signal peptide, in 
particular the two 5' leader exons with the sequences 
encoding the signal peptidase site located within the 
second 21 -bp exon. 

Based on their cytoplasmic ITIM-like motifs, the 
IRTAproteins can be considered members of the Inhibitory 
Receptor Superfamily (IRS) , a group of receptors that 
block activation of many cell types in the immune system 
(Lanier, 1998) . Such members include FCGRI IB and CD22 in 
the human (DeLisser et al . , 1994) and PIR-B in the mouse 
(Kubagawa et al . , 19 97). Analogous to IRS members, the 
I TIM of IRTA1 and IRTA2 are encoded by individual exons. 
A feature that many IRS members share is the existence of 
corresponding activating receptor isoforms whose 
cytoplasmic domains are devoid of ITIM (reviewed in 
Ravetch and Lanier, 1998) . It is possible that the 
secreted isoform of IRTA2 , which lacks ITIM-like motifs, 
fulfills an analogous role by counteracting the effect of 
the transmembrane isoform. 

Significant homology in the sequence and overall 
organization of their extracellular portion is shared 
among the IRTA1 and IRTA2 proteins and the Cell Adhesion 
Molecule (CAM) subfamily members PECAM1 , CD22 and BGP1. 
In addition, the ability of IRTA2 to generate three 
protein isoforms with distinct subcellular localization (a 
transmembrane, a GPI- linked or a secreted protein) by 
differential splicing is shared by NCAM, another member of 
the CAM subfamily (Dickson et al . , 1987; Gower et al . , 
1988) . Thus, the IRTA family is also related to the CAM 
family, as has been previously suggested for a member of 
the Fc receptor family (murine FCGRI I) because of its 



homology to PECAM1 (CAM, IRS family) (Daeron, 1991; Newman 
et al., 1990; Stockinger et al . , 1990). 



In conclusion, the IRTA family may represent an 
intersection among the Fc, IRS and CAM families, combining 
features from all three. Accordingly, IRTA proteins may 
have a role in the regulation of signal transduction 
during an immune response (like Fc receptors) , 
intercellular communication (like members of the IRS and 
CAM families) and cell migration {like CAM family members) 
(DeLisser et al . , 1994; Ravetch and Lanier, 2000). 
Initial experiments indicate that IRTA1 can weakly bind 
heat aggregated IgA, while IRTA2c can specifically bind 
heat aggregated human serum IgG (with higher affinity for 
IgGj. and IgG 2 ) , but not monomeric human IgG, IgA, IgM and 
IgE (data not shown) . These initial data lend support to 
a functional relationship between the IRTA and the Fc 
receptor families, but do not exclude functions dependent 
on other ligands for the IRTA proteins . 



Differential pattern of expression of IRTA genes in mature 
B cells 

The IRTA genes display a specific pattern of expression in 
various normal B cell compartments. IRTA1 is 

topographically restricted to B cells within the 
perifollicular region, which was originally named marginal 
zone in the spleen, but is also detectable in most 
lymphoid organs (de Wolf-Peeters et al., 1997). The in 
situ hybridization data presented here have been confirmed 
by immunohistochemical analysis using anti-IRTAl 
antibodies which show that the IRTA1 protein is 
selectively expressed in marginal zone B cells, and, among 
NHL, in marginal zone lymphoma, the tumors deriving from 
these cells (G. Cattoretti et al . , manuscript in 
preparation) . On the other hand, IRTA2 has a broader 
pattern of expression that includes GC centrocytes, as 
well as a broad spectrum of perifollicular cells, which 



may include immunoblasts and memory cells. Initial data 
suggest that the pattern of expression of IRTA3 is 
analogous to IRTA2 , while IRTA4 and IRTA5 are selectively 
expressed in mantle zone B cells (I. Miller et al . , 
manuscript in preparation) , the pre-GC compartment of 
mature B cells (MacLennan, I. C, 1994). This topographic 
restriction of IRTA gene expression in lymphoid organs 
suggests that the IRTA molecules may play a role in the 
migration or activity of various B cell subpopulations in 
specific functional B cell compartments. In addition, 
IRTA expression should be useful for the differential 
diagnosis of NHL subtypes deriving from various B cell 
compartments, particularly IRTA1 in the diagnosis of 
marginal zone lymphoma. 

IRTA1 locus and lq21 abnormalities in MM 

In the FR4 cell line, the consequence of the t(l;14) 
translocation is the formation of an IRTAl/Ca ± fusion gene. 
Despite the fact that this gene is driven by the IRTA1 
promoter region, which is normally silent in plasma cells, 
its expression is high in FR4 , presumably due to the 
influence of the Ca ± 3' LCR, which is retained downstream 
of the Ca x locus. The fusion gene encodes a IRTAI/OXl 
fusion protein which contains only the signal peptide and 
first two amino acids of IRTA1 linked to the surface IgA 
receptor. The latter has been almost completely deprived 
of its extracellular domain, but retains all its 
transmembrane and intracellular domains. This structure 
indicates that the IRTAl/Caj. fusion protein, though 
probably unable to bind any ligand, may retain the 
potential for dimerization and signaling. In particular, 
the membrane (m) IgA-derived extracellular portion 
contains a cysteine residue, which can be involved in 
disulphide bonds between two a-chains or between a-chains 
and associated proteins, such as the auxilliary surface 
receptor CD19 (Leduc et al . , 1997). The fusion protein 
also carries the intact, 14 amino acid mlgA cytoplasmic 



domain, which is highly conserved in evolution (Reth, 
1992) and may play an essential role in the proliferation, 
survival and differentiation of mature B-cells, analogous 
to the role of mlgG and mlgE (Kaisho et al . , 1997) . Thus, 
the emergence of the IRTAl/Ca! protein in FR4 may have 
provided the cells with a proliferative and survival 
advantage during tumor development through ligand 
(antigen) -independent activation of the BCR pathway. This 
fusion event however, appears to be rare in B-cell 
malignancy, since so far we were able to detect it only in 
FR4 cells. 

IRTA2 locus and lq21 abnormalities in MM and BL 

Abnormal expression of IRTA2 is a frequent consequence of 
lq21 abnormalities. Although this gene is not expressed 
normally either in centroblasts , the presumed normal 
counterparts of BL (Kuppers et al . , 1999), or in BL with 
normal lq21, its levels are upregulated on average by 10- 
fold in BL cell lines with lq21 abnormalities. This 
deregulation appears to be specific for IRTA2 since all 
the other 4 IRTA genes present within 3 00 kb on lq21 are 
either not expressed in BL ( IRTA1) , or their pattern of 
expression does not correlate with the presence of lq21 
abnormalities {IRTA3, 4, 5, not shown) . The mechanism by 
which this deregulation occurs is difficult to ascertain 
in the absence of structural lesions within or adjacent to 
the IRTA2 gene. Since the heterogeneous aberrations that 
affect lq21 all cause an excess copy number of the IRTA 
locus, it is possible that this may lead to regulatory 
disturbances, as is the case for low level amplification 
of BCL2 in FL lacking (14; 18) translocations (Monni et 
al., 1997), REL in diffuse large cell lymphoma 
(Houldsworth et al . , 1996; Rao et al., 1998) and 
deregulation of Cyclin Dl in some MM cases with trisomy 11 
(Pruneri et al . , 2000). On the other hand, lq21 

abnormalities, including translocations and duplications, 
change the genomic context of the IRTA locus and may lead 
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to deregulation of IRTA2 by distant cis-acting enhancer 
chromatin organizing elements acting on its promoter as is 
the case for MYC in endemic BL (Pelicci et al . , 1986) and 
MM (Shou et al . , 2000) and for CCND1 in mantle cell 
lymphoma (Bosch et al . , 1994; Swerdlow et al . , 1995) and 
MM (Pruneri et al . , 2000). 

The biological consequences of deregulated IRTA2 
expression are difficult to predict at this stage. The 
observation that IRTA2 has homology with CAM adhesion 
receptors, together with its specific distribution in the 
light zone of the GC suggest that its ectopic expression 
in centroblasts may cause a disruption in the GC 
development and architecture. On the other hand, our 
initial observations that IRTA2 can bind IgG immune 
complexes comparably to bona fide Fc receptors suggest 
that its inappropriate expression may perturb the dynamics 
of cell surface regulation of B cell immunological 
responses, possibly leading to clonal expansion. 
Deregulated expression of FCGR2B as a result of the 
t (1; 14) (q21;q32) in follicular lymphoma has been proposed 
to contribute to lymphomagenesis in this tumor type 
(Callanan et al . , 2000), by a mechanism involving escape 
by tumor cells of anti-tumor immune surveillance through 
their Fc binding and inactivation of tumor specific IgG. 
Similar evasion mechanisms have been observed in cells 
infected by Fc-encoding herpesvisures (Dubin et al . , 
1991) . The role of IRTA2 deregulation needs to be tested 
in "gain of function" transgenic mice constitutively 
expressing IRTA2 in the GC. 
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Third Series of Experiments 

Chromosome lq21 is frequently altered by translocations 
and duplications in several types of B cell malignancy, 
including multiple myeloma, Burkitt lymphoma, marginal 
zone lymphomas, and follicular lymphoma. To identify the 
genes involved in these aberrations, cloned was the 
chromosomal breakpoint of a t (1 ; 14 ) (q21 ;q32 ) in the 
myeloma cell line FR4 . A 300kb region spanning the 
breakpoint contains at least five highly related adjacent 
genes which encode surface receptor molecules that are 
members of the immunoglobulin gene superfamily, and thus 
called IRTA (Immunoglobulin Receptor Translocation 



Associated) . The various IRTA molecules have from three to 
nine extracellular immunoglobulin superfamily domains and 
are related to the Fc gamma receptors . They have 

transmembrane and cytoplasmic domains containing ITIM-like 
and ITAM-like (ITRA-1, IRTA- 3, IRTA-4) signaling motifs. 
In situ hybridization experiments show that all IRTA genes 
are expresed in the B cell lineage with distinct 
developmental stage-specific patterns: IRTA-1 is expressed 
in a marginal B cell pattern. IRTA-2 is expressed in 
centrocytes and more mature B cells. As a result of the 
translocation in FR4 , IRTA-1 is broken and produces a 
fusion transcript with the immunoglobulin locus. The 
IRTA-2 gene, normally silent in centroblasts, is 
overexpressed in multiple myeloma and in Burkitt lymphoma 
cell lines carrying lq21 abnormalities. The data here 
suggests that IRTA genes are novel B cell regulatory 
molecules that may also have a role in lymphomagenesis . 
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1. An isolated nucleic acid molecule which encodes 
immunoglobulin receptor, Immunoglobulin superfamily 
Receptor Translocation Associated, IRTA, protein. 

2. The isolated nucleic acid molecule of claim 1, 
wherein the IRTA protein is IRTA1 protein comprising 
the amino acid sequence set forth in Figure 18A (SEQ 
ID N0:1) . 

3. The isolated nucleic acid molecule of claim 1, 
wherein the IRTA protein is IRTA2 protein comprising 
the amino acid sequence set forth in Figures 18B-1- 
18B-3 (SEQ ID NO:3) . 

4. The isolated nucleic acid molecule of claim 1, 
wherein the IRTA protein is IRTA3 protein comprising 
the amino acid sequence set forth in Figures 18C-1- 
18C-2 (SEQ ID N0:5) . 

5. The isolated nucleic acid molecule of claim 1, 
wherein the IRTA protein is IRTA4 protein comprising 
the amino acid sequence set forth in Figures 18D-1- 
18D-2 (SEQ ID NO: 7) . 

6. The isolated nucleic acid molecule of claim 1, 
wherein the IRTA protein is IRTA5 protein comprising 
the amino acid sequence set forth in Figures 18E-1- 
18E-2 (SEQ ID NO: 9) . 



An isolated nucleic acid molecule of claim 1, wherein 
the nucleic acid molecule is DNA. 

The isolated DNA molecule of claim 2, wherein the DNA 
is cDNA. 

The isolated DNA molecule of claim 2, wherein the DNA 
is genomic DNA. 

The isolated nucleic acid molecule of claim 1, 
wherein the nucleic acid molecule is an RNA molecule. 

The isolated DNA molecule of claim 2, wherein the DNA 
molecule is cDNA having the nucleotide sequence set 
forth in Figure 18A (SEQ ID NO:2). 

The isolated DNA molecule of claim 2, wherein the DNA 
molecule is cDNA having the nucleotide sequence set 
forth in Figure 18A (SEQ ID NO : 4 ) . 

The isolated DNA molecule of claim 2, wherein the DNA 
molecule is cDNA having the nucleotide sequence set 
forth in Figure 18A (SEQ ID NO : 6 ) . 

The isolated DNA molecule of claim 2, wherein the DNA 
molecule is cDNA having the nucleotide sequence set 
forth in Figure 18A (SEQ ID NO: 8) . 

The isolated DNA molecule of claim 2, wherein the DNA 
molecule is cDNA having the nucleotide sequence set 
forth in Figure 18A (SEQ ID NO: 10). 



The isolated nucleic acid molecule of claim 1, 
wherein the nucleic acid molecule encodes a human 
IRTA1 protein. 

The isolated nucleic acid molecule of claim 1 
operatively linked to a promoter of DNA 
transcription . 

The isolated nucleic acid molecule of claim 17, 
wherein the promoter comprises a bacterial, yeast, 
insect, plant or mammalian promoter. 

A vector comprising the nucleic acid molecule of 
claim 17. 

The vector of claim 19, wherein the vector is a 
plasmid. 

A host cell comprising the vector of claim 20. 

The host cell of claim 21, wherein the cell is 
selected from a group consisting of a bacterial cell, 
a plant cell, and insect cell and a mammalian cell. 

An isolated nucleic acid molecule comprising at least 
15 contiguous nucleotides capable of specifically 
hybridizing with a unique sequence included within 
the sequence of the isolated nucleic acid molecule 
encoding IRTA1 protein of claim 1. 

The isolated nucleic acid molecule of claim 23 
labeled with a detectable marker. 



The nucleic acid molecule of claim 24, wherein the 
detectable marker is selected from the group 
consisting of a radioactive isotope, enzyme, dye, 
biotin, a fluorescent label or a chemiluminescent 
label. 

A method for detecting a B cell malignancy or a type 
of B cell malignancy in a sample from a subject 
wherein the B cell malignancy comprises a lq21 
chromosomal rearrangement which comprises: 

a) obtaining RNA from the sample from the subject; 

b) contacting the RNA of step (a) with a nucleic 
acid molecule of at least 15 contiguous 
nucleotides capable of specifically hybridizing 
with a unique sequence included within the 
sequence of an isolated RNA encoding human IRTA 
protein selected from the group consisting of 
human IRTA1 , IRTA2 , IRTA3 , IRTA4 and IRTA5 , 
under conditions permitting hybridization of the 
RNA of step (a) with the nucleic acid molecule 
capable of specifically hybridizing with a 
unique sequence included within the sequence of 
an isolated RNA encoding human IRTA protein, 
wherein the nucleic acid molecule is labeled 
with a detectable marker; and 

c) detecting any hybridization in step (b) , wherein 
detecion of hybridization indicates presence of 
B cell malignancy or a type of B cell malignancy 
in the sample. 
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The method of claim 26, wherein the detectable marker 
is radioactive isotope, enzyme, dye, biotin, a 
fluorescent label or a chemiluminescent label. 

The method of claim 26, wherein the B cell malignancy 
is selected from the group consisting of B cell 
lymphoma, multiple myeloma, Burkitt ' s lymphoma, 
marginal zone lymphoma, diffuse large cell lymphoma 
and follicular lymphoma cells. 

The method of claim 28, wherein the B cell lymphoma 
is Mucosa-Associated-Lymphoid Tissue B cell lymphoma 
(MALT) . 

The method of claim 28, wherein the B cell lymphoma 
is non-Hodgkin 1 s lymphoma. 

An antisense oligonucleotide having a sequence 
capable of specifically hybridizing to an mRNA 
molecule encoding a human ITRA protein so as to 
prevent overexpression of the mRNA molecule. 

The antisense oligonucleotide of claim 31, wherein 
the ITRA protein selected from the group consisting 
of human IRTA1 , IRTA2 , IRTA3 , IRTA4 and IRTA5 
protein . 

A purified IRTA1 protein comprising the amino acid 
sequence set forth in Figure 18A (SEQ ID N0:1) . 
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The purified IRTA1 protein of claim 33, wherein the 
IRTA1 protein is human IRTA1 . 



A purified IRTA2 protein comprising the amino acid 
sequence set forth in Figures 18B-1-18B-3 (SEQ ID 
NO: 3) . 

The purified IRTA2 protein of claim 35, wherein the 
IRTA2 protein is human IRTA2 . 

A purified IRTA3 protein comprising the amino acid 
sequence set forth in Figures 18C-1-18C-2 (SEQ ID 
NO : 5 ) . 

The purified IRTA3 protein of claim 37, wherein the 
IRTA3 protein is human IRTA3 . 

A purified IRTA4 protein comprising the amino acid 
sequence set forth in Figures 18D-1-18D-2 (SEQ ID NO: 
7) . 

The purified IRTA3 protein of claim 39, wherein the 
IRTA4 protein is human IRTA4 . 

A purified IRTA5 protein comprising the amino acid 
sequence set forth in Figures 18E-1-18E-2 (SEQ ID NO: 
9) . 

The purified IRTA5 protein of claim 41, wherein the 
IRTA5 protein is human IRTA5 . 

An antibody directed to a purified IRTA protein 
selected from the group consisting of human IRTA1, 
IRTA2, IRTA3, IRTA4 and IRTA5 . 



44 . 
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The antibody of claim 43, wherein the IRTA protein is 
human IRTA protein. 



The antibody of claim 43, wherein the antibody is a 
monoclonal antibody or a polyoclonal antibody. 

The antibody of claim 43, wherein the monoclonal 
antibody is a murine monoclonal antibody or a 
humanized monoclonal antibody. 

The antibody of claim 43, wherein the antibody is 
conjugated to a therapeutic agent, wherein the 
therapeutic agent is selected from the group 
consisting of a radioisotope, a toxin, a toxoid, or a 
chemotherapeutic agent . 

A pharmaceutical composition comprising an amount of 
the antibody of claim 43 effective to bind to cancer 
cells expressing an IRTA protein selected from the 
group consisting of human IRTA1, IRTA2 , IRTA3 , IRTA4 
and IRTA5 so as to prevent growth of the cancer cells 
and a pharmaceut ically acceptable carrier. 

The pharmaceutical composition of claim 48, wherein 
the cancer cells are selected from the group 
consisting of B cell lymphoma, a mantle cell lymphoma 
multiple myeloma, Burkitt ' s lymphoma, marginal zone 
lymphoma, diffuse large cell lymphoma and follicular 
lymphoma cells. 



The pharmaceutical composition of claim 49, wherein 
the B cell lymphoma cells are Mucosa-Associated- 
Lymphoid Tissue B cell lymphoma (MALT) cells. 

The pharmaceutical composition of claim 49, wherein 
the B cell lymphoma cells are non-Hodgkin ' s lymphoma 
cells . 

A pharmaceutical composition comprising an amount of 
the oligonucleotide of claim 31 effective to prevent 
overexpression of a human IRTA protein and a 
pharmaceutically acceptable carrier. 

A method of diagnosing B cell malignancy which 
comprises a lq21 chromosomal rearrangement in a 
sample from a subject which comprises: 

a) obtaining the sample from the subject ,- 

b) contacting the sample of step (a) with the 
antibody of claim 43 capable of specifically 
binding with a human IRTA protein selected from 
the group consisting of human IRTA1, IRTA2 , 
IRTA3, IRTA4 and IRTA5 IRTA protein on a cell 
surface of a cancer cell under conditions 
permitting binding of the antibody with human 
IRTA protein on the cell surface of the cancer 
cell, wherein the antibody is labeled with a 
detectable marker; and 

c) detecting any binding in step (b) , wherein 
detecion of binding indicates a diagnosis of B 
cell malignancy in the sample. 



The method of claim 53, wherein the B cell malignancy 
is selected from the group consisting of B cell 
lymphoma, multiple myeloma, Burkitt ' s lymphoma, 
mantle cell lymphoma, marginal zone lymphoma, diffuse 
large cell lymphoma and follicular lymphoma. 

The method of claim 54, wherein the B cell lymphoma 
is Mucosa-Associated-Lymphoid Tissue B cell lymphoma 
(MALT) . 

The method of claim 54, wherein the B cell lymphoma 
is non-Hodgkin ' s lymphoma. 

A method of treating a subject having a B cell cancer 
which comprises administering to the subject an 
amount of anti-IRTA antibody effective to bind to 
cancer cells expressing an IRTA protein selected from 
the group consisting of human IRTA1, IRTA2 , IRTA3 , 
IRTA4 and IRTA5 so as to prevent growth of the cancer 
cells and a pharmaceutically acceptable carrier, 
thereby treating the subject. 

The method of claim 57, wherein the anti-IRTA 
antibody is a monoclonal antibody. 

The method of claim 58, wherein the monoclonal 
antibody is a murine monoclonal antibody or a 
humanized monoclonal antibody. 
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The method of claim 57, wherein the anti-IRTA 
antibody is a polyoclonal antibody. 



The method of claim 57, wherein the B cell cancer is 
selected from the group consisting of B cell 
lymphoma, multiple myeloma, mantle cell lymphoma, 
Burkitt ' s lymphoma, marginal zone lymphoma, diffuse 
large cell lymphoma and follicular lymphoma. 

The method of claim 61, wherein the B cell lymphoma 
is Mucosa-Associated-Lymphoid Tissue B cell lymphoma 
(MALT) . 

The method of claim 61, wherein the B cell lymphoma 
is non-Hodgkin 1 s lymphoma. 

A method of treating a subject having a B cell cancer 
which comprises administering to the subject an 
amount of the oligonucleotide of claim 31 effective 
to prevent overexpression of a human IRTA protein, so 
as to arrest cell growth or induce cell death of 
cancer cells expressing IRTA protein (s) and a 
pharmaceutically acceptable carrier, thereby treating 
the subj ect . 

The method of claim 64, wherein the IRTA protein is 
selected from the group consisting of human IRTA1, 
IRTA2 , IRTA3 , IRTA4 and IRTA 5 protein . 

The method of claim 64, wherein the B cell cancer is 
selected from the group consisting of B cell 
lymphoma, mantle cell lymphoma, multiple myeloma, 
Burkitt ' s lymphoma, marginal zone lymphoma, diffuse 
large cell lymphoma and follicular lymphoma. 



The method of claim 66, wherein the B cell lymphoma 
is Mucosa-Associated-Lymphoid Tissue B cell lymphoma 
(MALT) . 

The method of claim 66, wherein the B cell lymphoma 
is non-Hodgkin ' s lymphoma. 
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ISOLATION OF FIVE NOVEL GENES CODING FOR NEW Fc RECEPTORS - 
TYPE MELANOMA INVOLVED IN THE PATHOGENESIS OF 
LYMPHOMA/MELANOMA 

5 Abstract of the Disclosure 

This invention provides an isolated nucleic acid molecule 
which encodes immunoglobulin receptor, Immunoglobulin 
superf amily Receptor Translocation Associated, IRTA, 
10 protein. Provided too, are the IRTA proteins encoded by 

the isolated nucleic acid molecules, IRTA1, IRTA2 , IRTA3 , 
; ==, ; IRTA4 or IRTA5 proteins, having the amino acid sequences 

;^ set forth in any of Figures 18A, 18B-1-18B-3, 18C-1-18C-2, 

HI 18D-1-18D-2 or 18E-1-18E-2. Oligonucleotides of the 

lMi isolated nucleic acid molecules are provided. Antibodies 

' ; :5 directed to an epitope of a purified IRTA1 , IRTA2, IRTA3 , 

IRTA4 or IRTA5 proteins are also provided, as are 
y„ pharmaceutical compositions comprising such antibodies or 

■jf oligonucleotides. Methods for detecting a B cell 

2 0 malignancy in a sample from a subject; diagnosing B cell 

malignancy in a sample from a subject; detecting human 
IRTA protein in a sample; and treating a subject having a B 
cell cancer are also provided. 
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FIGURE 5 
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CTCAATCAGCTTTATGCAGAGAAGAAGCTTACTGAGCTCACTGCTGGTGCTGGTGTAGGCAAGTGCTGCTTTGGCAA 

M L L W A S 

TCTGGGCTGACCTGGCTTGTCTCCTCAGAACTCCTTCTCCAACCCTGGAGCAGGCTTCCATGCTGCTGTGGGCGTCC 
LLAFAPVC GW QSAAAHKPVISVHPPWT 32 
TTGCTGGCCTTTGCTCCAGTCTGTGGACAATCTGCAGCTGCACACAAACCTGTGATTTCCGTCCATCCTCCATGGAC 

TFFKGERVT1TCNGFQFYATEKTTWY58 
CACATTCTTCAAAGGAGAGAGAGTGACTCrGACTTGCAATGGATTTCAGTTCTATGCAACAGAGAAAACAACATGGT 

HRHYWGEKLTLTPGNTLEVRESGLY 83 
ATCATCGGCACTACTGGGGAGAAAAGTTGACCCTGACCCCAGGAAACACCCTCGAGGTTCGGGAATCTGGACTGTAC 
RCQARGSPRSNPVRLliFSSDSLILQA 109 
AGATGCCAGGCCCGGGGCTCCCCACGAAGTAACCCTGTGCGCTTGCTCTTTTCTTCAGACTCCTTAATCCTGCAGGC 

PYSVFEG DTLVLRCHRRRKEKLTAVK 135 

YTWNGNILSISNKSWDLLI PQASS 160 
AATATACTTGGAATGGAAACATTCTTTCCATTTCTAATAAAAGCTGGGATCTTCTTATCCCACAAGCAAGTTCAAAT 
NNGNYRCIGYGDENDVFRSNFKIIKI186 
AACAATGGCAATTATCGATGCATTGGATATGGAGATGAGAATGATGTATTTAGATCAAATTTCAAAATAATTAAAAT 

QELFPHPELKATDSQPTEGNSVNLSC 212 
TCAAGAACTATTTCCACATCCAGAGCTGAAAGCTACAGACTCTCAGCCTACAGAGGGGAATTCTGTAAACCTGAGCT 

ETQLPPERSDTPLHFNFFRDGEVIL 237 
GTGAAACACAGCTTCCTCCAGAGCGGTCAGACACCCCACTTCACTTCAACTTCTTCAGAGATGGCGAGGTCATCCTG 
SDWSTYPELQLPTVWRSNSGSYWCGA263 
TCAGACTGGAGCACGTACCCGGAACTCCAGCTCCCAACCGTCTGGAGAGAAAACTCAGGATCCTATTGGTGTGGTGC 

ETVRGNIHKHSPSLQIHVQRIPVSGV 289 
TGAAACAGTGAGGGGTAACATCCACAAGCACAGTCCCTCGCTACAGATCCATGTGCAGCGGATCCCTGTGTCTGGGG 

LLETQPSGGQAVEGEMLVLVCSVAE 314 
TGCTCCTGGAGACCCAGCCCTCAGGGGGCCAGGCTGTTGAAGGGGAGATGCTGGTCCTTGTCTGCTCCGTGGCTGAA 
GTGDTTFSWHREDMQESLGRKTQRSL 340 

RAELELPAIRQSHAGGYYCTADNSYG 366 
GAGAGCAGAGCTGGAGCTCCCTGCCATCAGACAGAGCCATGCAGGGGGATACTACTGTACAGCAGACAACAGCTACG 

PVQSMVLN. VTVRETPGNRDGLVAAG 391 
GCCCTGTCCAGAGCATGGTGCTGAATGTCACTGTGAGAGAGACCCCAGGCAACAGAGATGGCCTTGTCGCCGCGGGA 

A T S Q LLSAL*. r , AVA LLFH C W R R R K S G 417 

GCCACTGGAGGGCTGCTCAGTGCTCTTCTCCTGGCTGTGGCCCTGCTGTTTCACTGCTGGCGTCGGAGGAAGTCAGG 

VGFLGDETRLPPAPGPGES SHSICPA443 
AGTTGGTTTCTTGGGAGACGAAACCAGGCTCCCTCCCGCTCCAGGCCCAGGAGAGTCCTCCCATTCCATCTGCCCTG 

Q V E L Q S L Y V D V HPKKGDLV Y S E I Q T 468 
CCCAGGTGGAGCTTCAGTCGTTGTATGTTGATGTACACCCCAAAAAGGGAGATTTGGTATACTCTGAGATCCAGACT 

TQLGEEEEANTSRTLL EDKDVSVV Y S 494 

ACTCAGCTGGGAGAAGAAGAGGAAGCTAATACCTCCAGGACACTTCTAGAGGATAAGGATGTCTCAGTTGTCTACTC 

g V KTQHPDNSAGKISSKDEES* 515 

TGAGGTAAAGACACAACACCCAGATAACTCAGCTGGAAAGATCAGCTCTAAGGATGAAGAAAGTTAAGAGAATGAAA 
AGTTACGGGAACGTCCTACTCATGTGATTTCTCCCTTGTCCAAAGTCCCAGGCCCAGTGCAGTCCTTGCGGCACCTG 
GAATGATCAACTCATTCCAGCTTTCTAATTCTTCTCATGCATATGCATTCACTCCCAGGAATACTCATTCGTCTACT 
CTGATGTTGGGATGGAATGGCCTCTGAAAGACTTCACTAAAATGACCAGGATCCACAGTTAAGAGAAGACCCTGTAG 
TATTTGCTGTGGGCCTGACCTAATGCATTCCCTAGGGTCTGCTTTAGAGAAGGGGGATAAAGAGAGAGAAGGACTGT 
TATGAAAAACAGAAGCACAAATTTTGGTGAATTGGGATTTGCAGAGATGAAAAAGACTGGGTGACCTGGATCTCTGC 
TTAATACATCTACAACCATTGTCTCACTGGAGACTCACTTGCATCAGTTTGTTTAACTGTGAGTGGCTGCACAGGCA 
CTGTGCAAACAATGAAAAGCCCCTTCACTTCTGCCTGCACAGCTTACACTGTCAGGATTCAGTTGCAGATTAAAGAA 
CCCATCTGGAATGGTTTACAGAGAGAGGAATTTAAAAGAGGACATCAGAAGAGCTGGAGATGCAAGCTCTAGGCTGC 
GCTTCCAAAAGCAAATGATAATTATGTTAATGTCATTAGTGACAAAGATTTGCAACATTAGAGAAAAGAGACACAAA 
TATAAAATTAAAAACTTAAGTACCAACTCTCCAAAACTAAATTTGAACTTAAAATATTAGTATAAACTCAT AATAAA 
CTCTGCCTTTAAATAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 6a 



1 CGGTGCAGTGTCCTGACTGTAAGATCAAGTCCAAACCTGTTTTGGAATTGAGGAAACTTCTCTTTTGATCTCAGCCCTTG 

ML^WVILLVLAPVSGyQFART?R22 
31 GTGGTCCAGGTCTTCATGCTGCTGTGGGTGATATTACTGGTCCTGGCTCCTGTCAGTGGACAGTTTGCAAGGACACCCAG 

PI IFLQPPWTTVFQGERVTLTCKGFRF49 
161 GCCCATTATTTTCCTCCAGCCTCCATGGACCACAGTCTTCCAAGGAGAGAGAGTGACCCTCACTTGCAAGGGATTTCGCT 

YSPQKTKWYHRYLGKEIL.RETPDNIL 75 
241 TCTACTCACCACAGAAAACAAAATGGTACCATCGGTACCTTGGGAAAGAAATACTAAGAGAAACCCCAGACAATATCCTT 

EVQESGEYRCQAQGSPLSSPVHLDFSS 102 
321 GAGGTTCAGGAATCTGGAGAGTACAGATGCCAGGCCCAGGGCTCCCCTCTCAGTAGCCCTGTGCACTTGGATTTTTCTTC 

ASLILQAPLSVFEGDSVVLRCRAKAE V129 
401 AGCTTCGCTGATCCTGCAAGCTCCACTTTCTGTGTTTGAAGGAGACTCTGTGGTTCTGAGGTGCCGGGCAAAGGCGGAAG 

TLNNTIYKNDNVLAFLNKRTDFHI ? K 155 
431 TAACACTGAATAATACTATTTACAAGAATGATAATGTCCTGGCATTCCTTAATAAAAGAACTGACTTCCATATTCCTCAT 

ACLKDNGAYRCTGYKESCCPVSSNTVK 182 
561 GCATGTCTCAAGGACAATGGTGCATATCGCTGTACTGGATAT AAGGAAAGTTGTTGCCCTGTTTCTTCCAATACAGTCAA 

IQVQEPFTRPVLRAS SFQPISGNPVT L209 
641 AATCCAAGTCCAAGAGCCATTTACACGTCCAGTGCTGAGAGCC AGCTCCTTCCAGCCCATCAGCGGGAACCCAGTGACCC 

TCETQLSLERSDVPLRFRFFRDDQTL 235 
2 1 TGACCTGTGAGACCCAGCTCTCTCTAGAGAGGTCAGATGTCCCGCTCCGGTTCCGCTTCTTCAGAGATGACCAGACCCTG 

GLGWSLSPNFQITAMWSKDSGFYWCKA 262 
= 01 GGATTAGGCTGGAGTCTCTCCCCGAATTTCCAGATTACTGCCATGTGGAGTAAAG ATTCAGGGTTCTACTGGTGTAAGGC 

ATM PH S V I S D S ? R S W I QVQ I PA S H P V L289 
3 31 AGCAACAATGCCTCACAGCGTCATATCTGACAGCCCGAGATCCTGGATACAGGTGCAGATCCCTGCATCTCATCCTGTCC 

'TLSPEKALNFEGTKVT1.HCETQEDSL315 
9 61 TCACTCTCAGCCCTGAAAAGGCTCTGAATTTTGAGGG AACCAAGGTGACACTTCACTGTGAAACCCAGGAAGATTCTCTG 

RTLYRFYHEGVPLRHKSVRC ERGAS I S 342 
1041 CGCACTTTGTACAGGTTTTATCATGAGGGTGTCCCCCTGAGGCACAAGTCAGTCCGCTGTGAAAGGGGAGCATCCATCAG 

FSLTTENSGNYYCTADNGLGAKPSKA V369 
1 12 1 CTTCTCACTGACTACAGAGAATTCAGGG AACTACTACTGCACAGCTGACAATGGCCTTGGCGCCAAGCCCAGTAAGGCTG 

SLSVTVPVSHPVLNLSSPEDLIFEGA 395 
1201 TGAGCCTCTCAGTCACTGTTCCCGTGTCTCATCCTGTCCTCAACCTCAGCTCTCCTGAGGACCTGATTTTTGAGGGAGCC 

KVTLHCEAQRGSL P I LYQFHHEDAALE 422 
1281 AAGGTGACACTTCACTGTGAAGCCCAGAGAGGTTCACTCCCCATCCTGTACCAGTTTCATCATGAGGATGCTGCCCTGGA 

RRSANSAGGVAISFSLTAEHSGNYYC T449 
1361 GCGTAGGTCGGCCAACTCTGCAGGAGGAGTGGCCATCAGCTTCTCTCTGACTGCAGAGCATTCAGGGAACTACTACTGCA 

ADNGFGPQRSKAVSLSITVPVSHPVL475 
1441 CAGCTGACAATGGCTTTGGCCCCCAGCGCAGTAAGGCGGTGAGCCTCTCCATCACTGTCCCTGTGTCTCATCCTGTCCTC 

TLSSAEALTFEGATVTLHCEVQRGSPQ502 
^21 ACCCTCAGCTCTGCTGAGGCCCTGACTTTTGAAGGAGCCACTGTGACACTTCACTGTGAAGTCCAGAGAGGTTCCCCACA 

ILYQFYHEDMPLWSSSTPSVGRVSFS F529 
• 6 0 1 AATCCT ATACCAGTTTTATCATGAGGACATGCCCCTGTGGAGCAGCTCAACACCCTCTGTGGGAAGAGTGTCCTTCAGCT 

SLTEGHSGNYYCTADNGFGPQRSSVV 555 
- 6 S i TCTCTCTGACTGAAGGACATTCAGGGAATTACT ACTGCACAGCTGACAATGGCTTTGGTCCCCAGCGCAGTGAAGTGG7G 

SLFVTVPVSRPILTLRVPRAQAVVGDl 5S2 
" " 61 AGCCTT^TTGTCACTGTTCCAGTGTCTCGCCCCATCCTCACCCrCAGGGTTCCCAGGGCCCAGGCTGTGGTGGGGGACCT 

L2LHCEAPRGSPPILYWFYHEDVTLG S609 
" 34 1 GCTGGAGCTTCACTGTGAGGCCCCGAGAGGCTCTCCCCCAATCCTGTACTGGTTTTATCATGAGGATGTCACCCTGGGGA 

S SAPSGGEAS ?NL S 1 TAEHSGNY S C E 635 
" 9 2 1 GCAGCTCAGCCCCCTCTGG AGGAGAAGCTTCTTTCAACCTCTCTCTG ACTGC AG AACATTCTGG AAACT ACTCATGTGAG 

ANNGLVAQHSDTI SLSVIVPVSRPI LT662 
2 001 GCCAACAATGGCCTAGTGGCCCAGCACAGTGACACAATATCACTCAGTGTTATAGTTCCAGTATCTCGTCCCATCCTCAC 

FRAPRAQAVVGDLLELHCEALRGSS P 1689 
2 081 CTTCAGGGCTCCCAGGGCCCAGGCTGTGGTGGGGGACCTGCTGGAGC7TCACTGTGAGGCCCTGAGAGGCTCCTCCCCAA 

LYWFYHEDVTLGKI SAP S_ GGG A S FNL 715 
2161 TCCTGTACTGGTTTTATCATGAAGATGTCACCCTGGGTAAGATCTCAGCCCCCTCTGGAGGAGGGGCCTCCTTCAACCTC 

SLTTEHSGIYSCEADNGLEAQRSEMVT 742 
2241 TCTCTGACTACAGAACATTCTGGAATCTACTCCTGTGAGGCAGACAATGGTCTGGAGGCCCAGCGCAGTG AGATGGTGAC 

LKVAGEWALPTSSTSEN* 759 
■> 3 2 1 ACTGAAAGTTGCAGGTGAGTGGGCCCTGCCCACCAGCAGCACATCTGAGAACTGACTGTGCCTGTTCTCCCTGCAGCTGA 
? 4 0 1 AAATGGAGCCACAGAGCTCCTCAGGGCTGTTTGCTTGTGTGGCATCCCAGCACACTTCCTGCCTGCAGAACCTCCCTGTG 
2 481 AAAGTCTCGGATCCTTTGTGGTATGGTTCCAGGAATCTG ATGTTTCCCAGCAGTCTTCTTG AAGATG ATCAAAGC ACCTC 
2 561 ACTAAAAATGCAAATAAG ACTTTTTTAGAAC AT AAACTATATTCTGAACTG AAATTATTAC ATGAAAATG AAACCAAAG A 
2 641 ATTCTGAGCATATGTTTCTCTGCCGTAGAAAGGATTAAGCTGTTTCTTGTCCGGATTCTTCTCTCATTGACTTCTAAGAA 
2721 GCCTCTACTCTTGAGTCTCTTTCATTACTGGGGATGTAAATGTTCCTTACATTTCCACATTAAAAATCCTATGTTAACGA 
AAAAA 



8/34 
FIGURE 6b 



1 CGGTGCAGTGTCCTGACTGTAAGATCAAGTCCAAACC7G7777GGAATTGAGGAAACTT£7C7TTTGATCTCAGCCCTTG 

MLLWVIL^VLAPVS G TQ F A R T ? R 22 
SI C-TGGTCCAGGTCTTCATGCTGCTGTGGGTGATATTACTGGTCCTGGCTCCTGTCAGTGGACAGTTTGCAAGGACACCCAG 

PIIFLQPPWTTVFQGERVTLTCKGFRF49 
161 GCCCATTATTTTCCTCCAGCCTCCATGGACCACAGTCTTCCAAGGAGAGAGAGTGACCCTCACTTGCAAGGGATTTCGCT 

YSPQKTKWYHRYLGKEILRETPDNIL 75 

2 41 7C7ACTCACCACAGAAAACAAAATGGTACCATCGG7ACC7TGGGAAAGAAATACTAAGAGAAACCCCAGACAATATCC7T 

EVQESC-EYRCQAQGSPLSSPVHLDFSS 102 
321 C-AGGTTCAGGAATCTGGAGAGTACAGATGCCAGGCCCAGGGCTCCCCTCTCAGTAGCCCTGTGCACTTGGATTTTTCTTC 

ASLILQAPLSVFEGDSVVLRCRAKAEV 129 
4 01 AGCTTCGCTGATCCTGCAAGCTCC ACTTTCTGTGTTTGAAGGAGACTCTGTGGTTCTGAGGTGCCGGGCAAAGGCGGAAG 

TLNNTIYKNDNVLAFLNKR7DFHIPH 155 
481 TAACACTGAATAATACTATTTACAAGAATGATAATGTCCTGGCATTCCTTAATAAAAGAACTGACTTCCATATTCCTCAT 

ACLKDNGAYRCTGYKESCCPVSSNTVK 182 
561 GCATGTCTCAAGGACAATGGTGCATATCGCTGTACTGGATATAAGGAAAGTTGTTGCCCTGTTTCTTCCAATACAGTCAA 

IQVQEPFTRPVLRASSFQPISGNPVTL 209 
641 AATCCAAGTCCAAGAGCCATTTACACGTCCAGTGCTGAGAGCCAGCTCCTTCCAGCCCATCAGCGGGAACCCAGTGACCC 

7CETQLSLERSD7PLRFRFFRDDQTL 235 

7 21 TGACCTGTGAGACCCAGCTCTCTCTAGAGAGGTCAGATGTCCCGCTCCGGTTCCGCTTCTTCAGAGATGACCAGACCCTG 

GLGWSLSPNFQITAMWSKDSGFYWCKA 262 

8 01 GGATTAGGCTGGAGTCTCTCCCCGAATTTCCAGATTACTGCCATGTGGAGTAAAGATTCAGGGTTCTACTGGTGTAAGGC 

ATMPHSVISDSPRSWIQVQIPASHPVL 289 

3 81 AGCAACAATGCCTCACAGCGTCATATCTGACAGCCCG AGATCCTGGATACAGGTGCAGATCCCTGCATCTCATCCTGTCC 

7LSPEKALNFEG7KVTLHCZTQEDSL 315 

9 61 r CACTCTCAGCCCTGAAAAGGCTCTGAATTTTGAGGGAACCAAGGTGACACTTCACTGTGAAACCCAGGAAGATTCTCTG 

RTLYRFYHEGVPLRHKSVRCERGASIS 342 
1041 CGCACTTTGTACAGGTTTTATCATGAGGGTGTCCCCCTGAGGCACAAGTCAGTCCGCTGTGAAAGGGGAGCATCCATCAG 

FSLTTENSGNYYC7ADNGLGAKFSKAV369 
1121 CTTCTCACTGACTACAGAGAATTCAGGGAACTACTAC7GCACAGCTGACAATGGCCTTGGCGCCAAGCCCAGTAAGGCTG 

SLSV7VPVSHPVLNLSSPEDLIFEGA 395 
1201 7GAGCCTCTCAGTCAC7GTTCCCGTGTCTCATCCTG7CCTCAACCTCAGCTCTCCTGAGGACCTGATTTTTGAGGGAGCC 

KVTLHCEAQRGS — PILYQFHHEDAALE 422 
1281 AAGGTGACACTTCACTGTGAAGCCCAGAGAGGTTCAC7CCCCATCCTGTACCAGTTTCATCATGAGGATGCTGCCCTGGA 

RRSANSAGGVAISFSLTAEHSGNYYCT 449 
1361 GCGTAGGTCGGCCAACTCTGCAGGAGGAGTGGCCATCAGCT7CTCTCTGACTGCAGAGCATTCAGGGAACTACTACTGCA 

ADNGFGPQRSKAVSLSITVPVSHPVL 475 
1441 CAGCTGACAATGGC7TTGGCCCCCAGCGCAGTAAGGCGGTGAGCCTCTCCATCACTGTCCCTGTGTCTCATCCTGTCCTC 

7LSSAEALTFEGATVTLHCEVQRGSPQ 502 
1521 ACCCTCAGCTCTGC7GAGGCCC7GACTT7TG AAGGAGCCACTGTGACACTTCACTGTGAAGTCCAGAGAGGTTCCCCACA 

I L Y Q F Y H E D M P L :•; S S S T P S V G R V S F S F 529 
1601 AATCCTATACCAGTT7TATCATGAGGACATGCCCC7G7GGAGCAGC7CAACACCCTCTG7GGGAAGAGTGTCCTTCAGCT 

SL7EGHSGNYYC7ADNGFGPQRSEVV 555 
1681 7C7CTCTGACTGAAGGACATTCAGGGAATTACTAC7GCACAGC7GACAATGGCTTTGGTCCCCAGCGCAGTGAAGTGGTG 

SLFVTGKCWVLASKFPLAEFSLTHSFK 582 
17 61 AGCCTTTTTGTCAC7GGTAAGTGCTGGGT7C7TGCCAG7CACCCACCCCTGGCTGAGT7C7CTCTCACCCATTCCTTTAA 

NLFALSSFL? * stop 592 
1341 .iAATCTGTTTGCAC7GTCCAGTTTCCTCCCC7AATCA-.C7TAATCCCCTTCTTGGCT7CC7CCTCAACTAACTAGCTGGG 
1 S2 1 C-77T7CCGTACTCA7AAGTCC7GGCTCAGCCAGACCCC7AAAACAGC7CAGTAGAT7CCCCAGCTT77ACCAAATGAATT 
2001 7ATTTATTGTATT77CTCCTCATTCC77GTATGT7CCAACAG7ACGCCAATTTTTCT7GATGCACGGAGCGTGTCCTACT 
2081 7CTCTACTGACATTTACATATTAACTTAGCTACAAGCACAGTCT7ATAGATAAATATTGGTCAAGACCTTAAATTCTCCA 
2161 AAGGATTTCCAATCTTATGGTAGATTTGGAGAAAGC7GC7GGTGAACAAAGGGGGAAATGGCTCCCTAGGAACCAAC7CC 
2241 7CAAACTTCTGGAGTTTTTATGATCCCTTGTTTTC7AACCTGCTAAAATCAGTATCATTTTATTGTATTATTTTAAAAAA 
2321 ACTATTGTTGAAG7ATGACATACATTCAAGAAACGTG7GCAAATTGTATGTGTACGATTTGGTGTCTTTTTAGGAGCTAA 
2401 G7TGCTTCTGTTTT7ACTTGAATCTTTGT7TATAGAAACTGGGGGAAAGTTTACT7TCTTTTCAGAGAAGCCAAATGGTA 
2481 7GATAGAAAAATC77GAGCCTGATGTGTCAGACATGCCCCTAGCATAACTTGTTG AGTAAAGAGGTTATTTTTAAAATGT 
2561 GAATGTTCTGAGACTACTCCAAAGTCAGAGCCAAATC7ACTAGGAAGCTTC7AGACTTCACTCATTCTGCATCCCATTAC 
2641 7ATCTTTTTATCCATGTTTTACTTTCTTCTCATAT7CAGCAGCATCTTAAGCCTCTTTATTTTCTGTTTCTTGACTGTCA 
2721 CCCTTAATGCCAGTAGAATGTAAGCTTCATGAGAACAGAACTGCATCCATCTTGGTCT7CACAACATCCCTGTGCCTACT 
2801 CAGTGTTTGGCACACAGTAGGTCCTCAGTCAACAT77G7AATTTAGTGGACAGATGATATGACAAGATGATAAGAGGGGA 
2881 7TTAAAAAAATCATCTAGCAAAGCCCAAGAGGAAAAAAAACAAAGCTATTTTAGAAATGAAATACCAATTTGAAGCAGTA 
2961 AGAATAGATTGGA7ATCTTTGAAAACCATTAATTGAA7GAAGAACCAATTTGAGAAAACAATACAGAATGCAAAGTAGAA 
3041 AGATACAGAAATAAAGGCAAAAGTTATAATATGGAAA7CAGACAATGGATTTGTCTGTATCCAGTTATGTGGATAATTAA 
3 12 1 AATGGAGACCCTCAGAAAATTGAACCGAAGAGTAAAA7GAAACTCAAAAATGTAGTAGAAATTGTTGGGAAGTAAAGAAA 
3201 ACTTGAATATGTAGATCAGAACATATATGTTGATGACGTTATTGACTTTGAGGTTAAAAATATATATATGTGCCTATGAT 
3281 7ATGGGGAAAAAAGCAGTCGTCTCAGAAAGAAAAACA7CAAGTTAGTCTTAGACTTTGCAGTGCACTCAGTACCAAAGAG 
33 61 AGAGGAGGCCAGACTTGGACCTGCGAGGGAAGAATAATAACCGAAAATTTTATATCAATTCAAAAAGACATTGTCAAAAA 
3441 7ACAGGGATTCAGGAAACTGAGAATGCACTAAGCC77C7GGAAAAAACACCTAATGACAAAATCTAGCCCAACAAGATGT 
3 521 AAATGAATATAAAGGACTCATAATGAGGAAACCGCA77ATGACTGGCTCTCAACCCTGGCCGCATATTAGACTCGTCAAA 
3 601 GACCTTTGTAAAAGGTCACACATTGACTCGTCAAAGCCCCTCTCCAGACTAATTCAATTCAGAATCTCACAGATGGGGCC 
3 681 ACAGAATCAGTAT7TTTTGACACAACCTCAAGTGAGAATATTGTGTAGACAAGATTGGAAACCACTGATTTAGATATAGA 
37 61 AACAAAGGCTAATCAACTGTGAGAATTATGGTCACAGAATAGAAAGTAACTATTATGAACACTGAAAATGTAAAAAAAAT 
3 841 C-TAACAAAGAAAAATAGTTAG AGGAAGG AGAGGAAG7AAAGGAACAATCATTTTCTCATGATTATTATTATTTCAGAGTA 

3 921 AATTGTGAGTTA77TCACAATTCAAAAAGAATGGAC7GTTTTAAAAAATTAGTAATAGATTTCAAAATGTCCATTTTGTA 

4 001 AATCGTTTCTGAATACTTTGTCAACAGTTACTCATCATTA^TGGC77ATAC77CAC7AAAATTCCATGGAAAACCAACTA 
4081 GTAGCC7GTAGAGTCACATAGGAGAGAACAAGTGAAT7CTTTGGG7GGCGCAAGCATAGATG77AGGAC7GACAAAAAAA 
4161 AATAATAAAAATAAACCTGTGCA77C-ATATGATCACAAATGATCAGGGAAAGAGGAAACAGAAACTCTCATACGCCATTA 
4241 77ACAAG7GTAAATTGGTTCAACC777TCGTCTTAATTGACACA77GTAAT7GTATATATTTATGGAAGCACAGTT7GAT 
4321 ATTTTGATATACATACATGGTATA7AACGATCAAATTAGGATATTTAATGTACCCATCATCTCATGCATT7ATCATTTCT 
4401 77GG AATAAA AACATTCAAA AGCCAAAAAA AAAAAAAAAA AAAAAAAA 
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FIGURE 6c- 1 

i CGGTGCAGTGTCCTGACTGTAAGATCAAGTCCAAACCTGTTTTGGAATTGAGGAAACT^TCTTTTGATCTCAGCCCTTG 

M L L W V I 1. L. V L A ? V S GTQ F A R T ? R 22 
SI GTGGTCCAGGTCTTCATOCTGCTGTGGGTGATATTACTGGTCCTGGCTCCTGTCAGTGGACAGTTTGCAAGGACACCCAG 

PIIFLQPPWTTVFQGERVTLTCKGFRF49 
161 GCCCATTATTTTCCTCCAGCCTCCATGGACCACAGTCTTCCAAGGAGAGAGAGTGACCCTCACTTGCAAGGGATTTCGCT 

YSPQKTKWYHRYjGKEILRETPDNIL 75 
241 TCTACTCACCACAGAAAACAAAATGGTACCATCGGTACCTTGGGAAAGAAATACTAAGAGAAACCCCAGACAATA'rCCTT 

EVQESGEYRCQAQGSPLSSPVHLDFSS 102 
221 GAGGTTCAGGAATCTGGAGAGTACAGATGCCAGGCCCAGGGCTCCCCTCTCAGTAGCCCTGTGCACTTGGATTTTTCTTC 

ASLILQAPLSVFEGDSVVLRCRAKAEV129 
401 AGCTTCGCTGATCCTGCAAGCTCCACTTTCTGTGTTTGAAGGAGACTCTGTGGTTCTGAGGTGCCGGGCAAAGGCGGAAG 

TLNNTIYKNDNVLAFLNKRTDFHIPH 155 
481 TAACACTGAATAATACTATTTACAAGAATGATAATGTCCTGGCATTCC7TAATAAAAGAACTGACTTCCATATTCCTCAT 

ACLKDNGAYRCTGYKESCCPVSSNTVK 182 
561 GCATGTCTCAAGGACAATGGTGCATATCGCTGTACTGGATATAAGGAAAGTTGTTGCCCTG7TTCTTCCAATACAGTCAA 

IQVQEPFTRPVLRASSFQPISGNPVTL 209 
641 AATCCAAGTCCAAGAGCCATTTACACGTCCAGTGCTGAGAGCCAGCTCCTTCCAGCCCATCAGCGGGAACCCAGTGACCC 

TCETQLSLERSDVPLRFRFFRDDQTL 235 
7 21 TGACCTGTGAGACCCAGCTCTCTCTAGAGAGGTCAGATGTCCCGCTCCGGTTCCGCTTCTTCAGAGATGACCAGACCCTG 

GLGWSLSPNFQITAMWSXDSGFYWCKA 262 
801 GGATTAGGCTGGAGTCTCTCCCCGAATTTCCAGATTACTGCCATGTGG AGTAAAGATTCAGGGTTCTACTGGTGTAAGGC 

ATMPHSVI SDS ? R S W I QVQI PASHPVL289 
S 8 1 AGCAACAATGCCTCACAGCGTCATATCTGACAGCCCGAGATCCTGGATACAGGTGCAGATCCCTGCATCTCATCCTGTCC 

TLSPEKALNFEGTKV7LHCSTQEDSL 315 
9 61 7CAC7C7CAGCCC7GAAAAGGC7C7GAA7777GAGGGAACCAAGG7GACAC77CAC7G7GAAACCCAGGAAGA77C7C7G 

R7LYRFYHEGYPLRHKSVRCERGASIS 342 
1041 CGCACTTTGTACAGGTTTTATCATGAGGGTGTCCCCCTGAGGCACAAGTCAGTCCGCTGTGAAAGGGGAGCATCCATCAG 

FSL77ENSGNYYC7ADNGLGAKPSKAV3 69 
1121 CTTCTCACTGACTACAGAGAATTCAGGGAACTACTACTGCACAGCTGACAATGGCCTTGGCGCCAAGCCCAGTAAGGCTG 

SLSVTVPVSHPVLHLSSPEDLIFEGA 395 
1201 TGAGCCTCTCAGTCACTGTTCCCGTGTCTCATCCTGTCCTCAACCTCAGCTCTCCTGAGGACCTGATTTTTGAGGGAGCC 

KVTLHCEAQRGSLPILYQFHHEDAALE 422 

12 81 AAGGTGACACTTCACTGTGAAGCCCAGAGAGGTTCACTCCCCATCCTGTACCAGTTTCATCATGAGGATGCTGCCCTGGA 

RRSANSAGGVAXSFSLTAEHSGNYYCT 449 

13 61 GCG7AGG7CGGCCAAC7C7GCAGGAGGAG7GGCCA7CAGC77C7C7C7GAC7GCAGAGCA77CAGGGAAC7AC7AC7GCA 

ADNGFGPQRSKAVSLSI7VPVSHPVL 475 
1441 CAGCTGACAATGGCTTTGGCCCCCAGCGCAGTAAGGCGGTGAGCCTCTCCATCACTGTCCCTGTGTCTCATCCTGTCCTC 

TLSSAEALTFEGATVTLHCEVQRGSPQ 502 
1521 ACCCTCAGCTCTGCTGAGGCCCTGACTTTTGAAGGAGCCACTGTGACACTTCACTGTGAAGTCCAGAGAGGTTCCCCACA 

I L Y Q F Y H E D M ? L :v S S S T P S V G R V S F S F 529 
1601 AATCCTATACCAGTTTTATCATGAGGACATGCCCGTGTGGAGCAGCTCAACACCCTCTGTGGGAAGAGTGTCCTTCAGCT 

SLTEGKSGNYYCTADNGFGPQRSEVV 555 
1681 7C7C7C7GAC7GAAGGACA77CAGGGAA77AC7AC7GCACAGC7GACAA7GGC777GG7CCCCAGCGCAG7GAAG7GG7G 

SLFVTVPVSRP ILTLRVPRAQAVVGDL 582 
1"61 AGCC77777G7CAC7G77CCAG7G7C7CGCCCCA7CC7CACCC7CAGGG77CCCAGGGCCCAGGC7G7GG7GGGGGACC7 

1 E L H C E A P R G 3 ? ? I L Y V; F Y H E D V 7 1, G S 609 
1 S 4 1 GCTGGAGCTTCACTGTGAGGCCCCGAGAGGCTCTCCCCCAATCCTGT ACTGGTTTTATCArG AGGATGTCACCCTGGGGA 

SSAPSGGEASF^LSLTAEHSGIIYSCE 635 
1921 GCAGC7CAGCCCCC7C7GGAGGAGAAGC77C777CAACC7C7C7C7GAC7GC AGAACA77 CTGGAAACTACTCATGTGAG 

A N N G L V A Q H S D T I S L S V I V P V S R P Z L T 662 
2001 GCCAACAATGGCCTAGTGGCCCAGCACAGTGACACAATATCACTCAGTGTTATAGTTCCAGT ATCTCGTCCCATCCTCAC 

FRAPRAQAVVGOLLE — HCEALRGSSPI 689 
2081 CTTCAGGGCTCCCAGGGCCCAGGCTGTGGTGGGGGACCTGCTGGAGCTTCACTGTGAGGCCCTGAGAGGCTCCTCCCCAA 

LYWFYHEDV7LGKI SAPSGGGASFNL 715 
2151 7CC7G7AC7GG7777A7CA7GAAGA7G7CACCC7GGG7AAGA7C7CAGCCCCC7C7GGAGGAGGGGCC7CC77CAACC7C 

SLTTEHSGIYSCEADN'GLEAQRSEMVT 742 
2241 7CTCTGAC7ACAGAACA77C7GGAA7C7AC7CC7G7GAGGCAGACAA7GG7C7GGAGGCCCAGCGCAG7GAGA7GG7GAC 

L K V A V P V S R P V 1 T L R A ? G T K A A V G D L L 769 
2321 ACTGAAAGTTGCAGTTCCGGTGTCTCGCCCGGTCCTCACCCTCAGGGCTCCCGGGACCCATGCTGCGGTGGGGGACCTGC 

ELHCEALRGSPLILYRFFHEDVTLGN 795 
2401 7GGAGCT7CAC7G7GAGGCCC7GAGAGGC7C7CCCC7GA7CC7G7ACCGG777777CA7GAGGA7G7CACCC7AGGAAAT 

RSSPSGGASLNLSLTAEHSGJJYSCSAD 822 
2481 AGGTCGTCCCCCTCTGGAGGAGCGTCCTTAAACCTCTCTCTGACTGCAGAGCACTCTGGAAACTACTCCTGTGAGGCCGA 

NGLGAQRSE7 7 7LYI7GL7ANRSGPFA 849 
2561 CAATGGCCTCGGGGCCCAGCGCAGTGAGACAGTGACACTTTATATCACAGGGCTGACCGCGAACAGAAGTGGCCCTTTTG 

T G V A G G L L S ~ A G L A A G A L L 1 Y C W L S R 875 
2641 CCACAGGAGTCGCCGGGGGCCTGCTCAGCATAGCAGC-CCTTGCTGCGGGGGCACTGCTC-CTCTACTGCTGGCTCrCGAGA 

KAGRKPASDP.-. RSPSDSDSQEPT Y H M V 902 
2721 AAAGCAGGGAGAAAGCCTGCCTCTGACCCCGCCAGGAGCCCTTCAGACTCGGACTCCCAAGAGCCCACCTATCACAATGT 

PAWEELQPVY7NANPRGENVV Y S E V R I 929 
2 S 0 1 ACCAGCC7GGGAAGAGC7GCAACCAGTG7ACAC7AATGCAAATCC7AGAGGAGAAAATG7GG77TAC7CAGAAG7ACGGA 

I Q E K K K H A V A S O ? R K L R N K G S P I 1 Y § 9 55 

28 81 7CATCCAAGAGAAAAAGAAACA7GCAG7GGCC7C7GACCCCAGGC A7C7CAGGAACAAGGG77CCCC7A7CA7C7ACTC7 
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FIGURE 6c -2 



E V KVASTPVSGSLFLASSAPHR* Stop 

2961 GAAGTTAAGGTGGCGTCAACCCCGGTTTCCGGATCCCTGTrCTTGGCTTCCTCAGCTCCTCACAGATGAGTCCACACGTC 

3 041 TCTCCAACTGCTGTTTCAGCCTCTGCACCCCAAAGTTCCCCTTGGGGGAGAAGCAGCATTGAAGTGGGAAGATTTAGGCT 

3201 TCACCTGTTTCCCAGTTAAAGCCCTGACTGGCAGGTTTTTTAATCCAGTGGCAAGGTGCTCCCACTCCAGGGCCCAGCAC 

3281 ATCTCCTGGATTCCTTAGTGGGCTTCAGCTGTGGTTGCTGTTCTGAGTACTGCTCTCATCACACCCCCACAGAGGGGGTC 

3361 TTACCACACAAAGGGAGAGTGGGCCTTCAGGAGATGCCGGGCTGGCCTAACAGCTCAGGTGCTCCTAAACTCCGACACAG 

3441 AGTTCCTGCTTTGGGTGGATGCATTTCTCAATTGTCATCAGCCTGGTGGGGCTACTGCAGTGTGCTGCCAAATGGGACAG 

3521 CACACAGCCTGTGCACATGGGACATGTGATGGGTCTCCCCACGGGGGCTGCATTTCACACTCCTCCACCTGTCTCAAACT 

3681 TACACCAAGTCCCGAAGTTCTTCCCATCCTAACAGAAGTAACCCAGCAAGTCAAGGCCAGGAGGACCAGGGGTGCAGACA 

3761 GAACACATACTGGAACACAGGAGGTGCTCAATTACTATTTGACTGACTGACTGAATGAATGAATGAATGAGGAAGAAAAC 

3841 TGTGGGTAATCAAACTGGCATAAAATCCAGTGCACTCCCTAGGAAATCCG<^AGGTATTCTGGCTTCCTAAGAAACAACG 

3921 GAAGAGAAGGAGCTTGGATGAAGAAACTGTTCAGCAAGAAGAAGGGCTTCTT<^CACTTITATGTGCTTGTGGATCACCT 

4081 ATGCTGATGCTGCTGGCCCTGGGACTGCACTGCATGCATGTGAAGCCCTATAGGTCTCAGCAGAGGCCCATGGAGAGGGA 

3GCTTTCATCAGCCACCCTGGGAAGCGGAAAGTAGC 

4321 TGCCACTATCTTTGTTTCCCCACCTCAGGCCTCACACTTTCCCATGAAAAGGGTGAATGTATATAACCTGAGCCCTCTCC 

4401 ATTCAGAGTTGTTCTCCCATCTCTGAGCAATGGGATGTTCTGTTCCGCTTTTATGATATCCATCACATCTTATCTTGATC 

4481 TTTGCTCCCAGTGGATTGTACAGTGATGACTTT T AAGCCCCACGGCCCTGAAATAAAATCCTTCCAAGGGCATTGGAAGC 

4641 CAACCTTTCAAGGAGGAAACCAGACACCTGAGACAGGAGCCTGTATGCAGCCCAGTGCAGCCTTGCAGAGGACAAGGCTG 

4721 GAGGCATTTGTCATCACTACAGATATGCAACTAAAATAGACGTGGAGCAAGAGAAATGCATTCCCACCGAGGCCGCTTTT 

4801 TTAGGCCTAGTTGAAAGTCAAGAAGGACAGCAGCAAGCATAGGCTCAGGATTAAAGAAAAAAATCTGCTCACAGTCTGTT 

4881 CTGGAGGTCACATCACCAACAAAGCrCACGCCCTATGCAGTTCTGAGAAGGTGGAGGCACCAGGCTCAAAAGAGGAAATT 

4961 TAGAATTTCTCATTGGGAGAGTAAGGTACCCCCATCCCAGAATGATAACTGCACAGTGGCAGAACAAACTCCACCCTAAT 

5041 GTGGGTGGACCCCATCCAGTCTGTTGAAGGCCTGAATGTAACAAAAGGGCTTATTCTTCCTCAAGTAAGGGGGAACTCCT 

5121 GCTTTGGGCTGGGACATAAGTTTTTCTGCTTTCAGACGCAAACTGAAAAATGGCTCTTCTTGGGTCTTGAGCTTGCTGGC 

5201 ATATGGACTGAAAGAAACTATGCTATTGGATCTCCTGGATCTCCAGCTTGCTGACTGCAGATCTTGAGATATGTCAGCCT 

5281 CTACAGTCACAAGAGCTAATTCATTCTAATAAACCAATCTTTC 
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FIGURE 18A 



1 CTCAATCAGCTTTATGCAGAGAAGAAGCTTACTGAGCTCACTGCTGGTGCTGGTGTAGGCAAGTGCTGCTTTGGCAA 

M L L W A S 6 

78 TCTGGGCTGACCTGGCTTGTCTCCTCAGAACTCCTTCTCCAACCCTGGAGCAGGCTTCCATGCTGCTGTGGGCGTCC 

LLAFAPVCG^SAAAHKPVISVHFPWT 32 
155 TTGCTGGCCTTTGCTCCAGTCTGTGGACAATCTGCAGCTGCACACAAACCTGTGATTTCCGTCCATCCTCCATGGAC 

TFFKGERVTLTCNGFQFYATEKTTWY58 
232 CACATTCTTCAAAGGAGAGAGAGTGACTCTGACTTGCAATGGATTTCAGTTCTATGCAACAGAGAAAACAACATGGT 

HRHYWGEKLTLTPGNTLEVRESGLY 83 
309 ATCATCGGCACTACTGGGGAGAAAAGTTGACCCTGACCCCAGGAAACACCCTCGAGGTTCGGGAATCTGGACTGTAC 

RCQARGSPRSNPVRLLFSSDSLILQA 109 
3 86 AGATGCCAGGCCCGGGGCTCCCCACGAAGTAACCCTGTGCGCTTGCTCTTTTCTTCAGACTCCTTAATCCTGCAGGC 

PYSVFEGDTLVLRCHRRRKEKLTAVK13 5 
463 ACCATATTCTGTGTTTG^^CfflgACATTGGTTCTGAGATGCCACAGAAGAAGGAAAGAGAAATTGACTGCTGTGA 

Y T W N G l^^^j SISNKSWDLLIPQASSN 160 
540 AATATACTTGGAATGGAAACATTCTTTCCATTTCTAATAAAAGCTGGGATCTTCTTATCCCACAAGCAAGTTCAAAT 

NNGNYRCIGYGDENDVFRSNFKIIKI 186 
617 AACAATGGCAATTATCGATGCATTGGATATGGAGATGAGAATGATGTATTTAGATCAAATTTCAAAA^^mAaAT 

QELFPHPELKATDSQPTEGNSV Bggf^^gl C 212 
694 TCAAGAACTATTTCCACATCCAGAGCTGAAAGCTACAGACTCTCAGCCTACAGAGGGGAATTCTGTAAACCTGAGCT 

ETQLPPERSDTPLHFNFFRDGEVIL 237 
771 GTGAAACACAGCTTCCTCC AGAGCGGTCAGACACCCCACTTCACTTCAACTTCTTCAGAGATGGCGAGGTCATCCTG 

SDWSTYPELQLPTVWRENSGSYWCGA 263 
848 TCAGACTGGAGCACGTACCCGGAACTCCAGCTCCCAACCGTCTGGAGAGAAAACTCAGGATCCTATTGGTGTGGTGC 

ETVRGNIHKHSPSLQIHVQRI PVSGV 289 
925 TGAAACAGTGAGGGGTAACATCCACAAGCACAGTCCCTCGCTACAGATCCATGTGCAGCGGATCCCTGTGTCTGGGG 

LLETQPSGGQAVEGEMLVLVCSVAE 314 
1002 TGCTCCTGGAGACCCAGCCCTCAGGGGGCCAGGCTGTTGAAGGGGAGATGCTGGTCCTTGTCTGCTCCGTGGCTGAA 

GTGDTTFSWHREDMQESLGRKTQRSL 340 
1079 GGCACAGGGGATACCACATTCTCCTGGCACCGAGAGGACATGCAGGAGAGTCTGGGGAGGAAAACTCAGCGTTCCCT 

RAELELPAIRQSHAGGYYCTADNSYG 366 
1156 GAGAGCAGAGCTGGAGCTCCCTGCCATCAGACAGAGCCATGCAGGGGGATACTACTGTACAGCAGACAACAGCTACG 

P V Q S M V L ^S^^^^ V RETPGNRDG LVAAG 391 
1233 GCCCTGTCCAGAGCATGGTGCTGAATGTCACTGTGAGAGAGACCCCASGCAACAGAGATGGCCTTGTCGCCGCGGGA 

A T G a T, L S A L I, L A V A L T, F H C W R R R K S G 417 
1310 GCCACTGGAGGGCTGCTCAGTGCTCTTCTCCTGGCTGTGGCCCTGCTGTTTCACTGCTGGCGTCGGAGGAAGTCAGG 

VGFLGDETRLPPAPGPGESSHSICPA 443 
1387 AGTTGGTTTCTTGGGAGACGAAACCAGGCTCCCTCCCGCTCCAGGCCCAGGAGAGTCCTCCCATTCCATCTGCCCTG 

Q V E L Q S L Y V D...V HPKKGDLV Y S E I Q T 468 
1464 CCCAGGTGGAGCTTCAGTCGTTGTATGTTGATGTACACCCCAAAAAGGGAGATTTGGTATACTCTGAGATCCAGACT 

TQLGEEEEANTSRTLLEDKDVSVV J£ § *9 4 

1541 ACTCAGCTGGGAGAAGAAGAGGAAGCTAATACCTCCAGGACACTTCTAGAGGATAAGGATGTCTCAGTTGTCTACTC 

g KTQHPDNSAGK1SSKDEES* 515 

1618 TGAGGTAAAGACACAACACCCAGATAACTCAGCTGGAAAGATCAGCTCTAAGGATGAAGAAAGTTAAGAGAATGAAA 
1695 AGTTACGGGAACGTCCTACTCATGTGATTTCTCCCTTGTCCAAAGTCCCAGGCCCAGTGCAGTCCTTGCGGCACCTG 
1772 GAATGATCAACTCATTCCAGCTTTCTAATTCTTCTCATGCATATGCATTCACTCCCAGGAATACTCATTCGTCTACT 
1849 CTGATGTTGGGATGGAATGGCCTCTGAAAGACTTCACTAAAATGACCAGGATCCACAGTTAAGAGAAGACCCTGTAG 
1926 TATTTGCTGTGGGCCTGACCTAATGCATTCCCTAGGGTCTGCTTTAGAGAAGGGGGATAAAGAGAGAGAAGGACTGT 
2003 TATGAAAAACAGAAGCACAAATTTTGGTGAATTGGGATTTGCAGAGATGAAAAAGACTGGGTGACCTGGATCTCTGC 
2080 TTAATACATCTACAACCATTGTCTCACTGGAGACTCACTTGCATCAGTTTGTTTAACTGTGAGTGGCTGCACAGGCA 
2157 CTGTGCAAACAATGAAAAGCCCCTTCACTTCTGCCTGCACAGCTTACACTGTCAGGATTCAGTTGCAGATTAAAGAA 
2234 CCCATCTGGAATGGTTTACAGAGAGAGGAATTTAAAAGAGG ACATCAGAAGAGCTGGAGATGCAAGCTCTAGGCTGC 
2311 GCTTCCAAAAGCAAATGATAATTATGTTAATGTCATTAGTGACAAAGATTTGCAACATTAGAGAAAAGAGACACAAA 
2388 TATAAAATTAAAAACTTAAGTACCAACTCTCCAAAACTAAATTTGAACTTAAAATATTAGTATAAACTCATAATAAA 
• CTCTGCCTTTAAATAAAAAAAAAAAAAAAAAAAAA 



IRTA1 cDNA and protein sequence 
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2 B 1 CGGTGCAGTGTCCTGACTGTAAGATCAAGTCCAAACCTGTTTTGGAATTGAGGAAACTTCTCTTTTGATCTCAGCCCTTG 

MLLWVILLVLAPVSG^QFARTPR 22 
81 GTGGTCCAGGTCTTCATGCTGCTGTGGGTGATATTACTGGTCCTGGCTCCTGTCAGTGGACAGTTTGCAAGGACACCCAG 

PIIFLQPPWTTVFQGERVTLTCKGFRF49 
161 GCCCATTATTTTCCTCCAGCCTCCATGGACCACAGTCTTCCAAGGAGAGAGAGTGACCCTCACTTGCAAGGGATTTCGCT 

YSPQKTKWYHRYLGKEILRETPDNIL 75 
241 TCTACTCACCACAGAAAACAAAATGGTACCATCGGTACCTTGGGAAAGAAATACTAAGAGAAACCCCAGACAATATCCTT 

EVQESGEYRCQAQGSPLSSPVHLDFSS 102 
321 GAGGTTCAGGAATCTGGAGAGTACAGATGCCAGGCCCAGGGCTCCCCTCTCAGTAGCCCTGTGCACTTGGATTTTTCTTC 

ASLILQAPLSVFEGDSVVLRCRAKAEV129 
401 AGCTTCGCTGATCCTGCAAGCTCCACTTTCTGTGTTTGAAGGAGACTCTGTGGTTCTGAGGTGCCGGGCAAAGGCGGAAG 

T I j^-jaTC-T- IVKNDNVLAFLNKRTDFHIPH 155 
481 TAACAC^*^»SS*CTATTTACAAGAATGATAATGTCCTGGCATTCCTTAATAAAAGAACTGACTTCCATATTCCTCAT 

ACLKDNGAYRCTGYKESCCPVSSNTVK 182 
561 GCATGTCTCAAGGACAATGGTGCATATCGCTGTACTGGATATAAGGAAAGTTGTTGCCCTGTTTCTTCCAATACAGTCAA 

IQVQEPFTRPVLRASSFQPISGNPVTL 209 
641 AATCCAAGTCCAAGAGCCATTTACACGTCCAGTGCTGAGAGCCAGCTCCTTCCAGCCCATCAGCGGGAACCCAGTGACCC 

TCETQLSLERSDVPLRFRFFRDDQTL 235 
721 TGACCTGTGAGACCCAGCTCTCTCTAGAGAGGTCAGATGTCCCGCTCCGGTTCCGCTTCTTCAGAGATGACCAGACCCTG 

GLGWSLSPNFQITAMWSKDSGFYWCKA 262 
801 GGATTAGGCTGGAGTCTCTCCCCGAATTTCCAGATTACTGCCATGTGGAGTAAAGATTCAGGGTTCTACTGGTGTAAGGC 

ATMPHSVISDSPRSWIQVQIPASHPVL 289 
881 AGCAACAATGCCTCACAGCGTCATATCTGACAGCCCGAGATCCTGGATACAGGTGCAGATCCCTGCATCTCATCCTGTCC 

TLSPEKALNFEGTKVTLHCETQEDSL 315 
961 TCACTCTCAGCCCTGAAAAGGCTCTGAATTTTGAGGGAACCAAGGTGACACTTCACTGTGAAACCCAGGAAGATTCTCTG 

RTLYRFYHEGVPLRHKSVRCERGASI S 342 
1041 CGCACTTTGTACAGGTTTTATCATGAGGGTGTCCCCCTGAGGCACAAGTCAGTCCGCTGTGAAAGGGGAGCATCCATCAG 

FSLTTENSGNYYCTADNGLGAKPSKAV 369 
1121 CTTCTCACTGACTACAGAGAATTCAGGGAACTACTACTGCACAGCTGACAATGGCCTTGGCGCCAAGCCCAGTAAGGCTG 

SLSVTVPVSHPVL £RBH| S PEDLIFEGA 395 
1201 TGAGCCTCTCAGTCACTGTTCCCGTGTCTCATCCTGTCCT(»^^^^;TCTCCTGAGGACCTGATTTTTGAGGGAGCC 

KVTLHCEAQRGSLPILYQFHHEDAALE 422 
12 81 AAGGTGACACTTCACTGTGAAGCCCAGAGAGGTTCACTCCCCATCCTGTACCAGTTTCATCATGAGGATGCTGCCCTGGA 

RRSANSAGGVAI SFSLTAEH SGNYYCT 449 
1361 GCGTAGGTCGGCCAACTCTGCAGGAGGAGTGGCCATCAGCTTCTCTCTGACTGCAGAGCATTCAGGGAACTACTACTGCA 



1521 ACCCTCAGCTCTGCTGAGGCCCTGACTTTTGAAGGAGCCACTGTGACACTTCACTGTGAAGTCCAGAGAGGTTCCCCACA 
ILYQFYHEDMPLWSSSTPSVGRVSFSF 
1601 AATCCTATACCAGTTTTATCATGAGGACATGCCCCTGTGGAGCAGCTCAACACCCTCTGTGGGAAGAGTGTCCTTCAGCT 

SLTEGHSGNYYCTADNGFGPQRSEVV 
1681 TCTCTCTGACTGAAGGACATTCAGGGAATTACTACTGCACAGCTGACAATGGCTTTGGTCCCCAGCGCAGTGAAGTGGTG 



2A r 2C1761 



IiTLRVPRAQAVVGD 



GKCWVLASHPPLAEFSLTHSFK 582 
2B 1761 GGTAAGTGCTGGGTTCTTGCCAGTCACCCACCCCTGGCTGAGTTCTCTCTCACCCATTCCTTTAA 

LELHCEAPRGSPPILYWFYHEDVTLGS 609 
2A,2C1841 GCTGGAGCTTCACTGTGAGGCCCCGAGAGGCTCTCCCCCAATCCTGTACTGGTTTTATCATGAGGATGTCACCCTGGGGA 

NLFALSSFLP* stop 592 
2B 1841 AAATCTGTTTGCACTGTCCAGTTTCCTCCCCTAATCAACTTAATCCCCTTCTTGGCTTCCTCCTCAACTAACTAGCTGGG 

SSAPSGGEASF KTS^S L T A E H S G E 635 

2A.2C1921 GCAGCTCAGCCCCCTCTGGAGGAGAAGCTTCTTTI^CCTCTCTCTGACTGCAGAACATTCTGGA^K^SroiTGTGAG 
2B 1921 GTTTTCCGTACTCATAAGTCCTGGCTCAGCCAGACCCCTAAAACAGCTCAGTAGATTCCCCAGCTTTTACCAAATGAATT 



2A,2C2001 



L V A Q H S D 



FRAPRAQAVVGDLLELHCEALRGSSPI 689 
2A,2C2081 CTTCAGGGCTCCCAGGGCCCAGGCTGTGGTGGGGGACCTGCTGGAGCTTCACTGTGAGGCCCTGAGAGGCTCCTCCCCAA 
2B 2 081 TCTCTACTGACATTTACATATTAACTTAGCTACAAGCACAGTCTTATAGATAAATATTGGTCAAGACCTTAAATTCTCCA 
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FIGURE 18B-2 



LYWFYHEDVTLGKISAPS.. GGGASF IsfSa^ 715 
2A.2C2161 TCCTGTACTGGTTTTATCATGAAGATGTCACCCTGGGTAAGATCTCAGCCCCCTCTGGAGGAGGGGCCTCCTTOJ^eSSI: 
2B 2161 AAGGATTTCCAATCTTATGGTAGATTTGGAGAAAGCTGCTGGTGAACAAAGGGGGAAATGGCTCCCTAGGAACCAACTCC 

LTTEHSGIYSCEADNGLEAQRSEMVT 7 42 
2A , 2C22 4 1 TCTCTGACTACAGAACATTCTGGAATCTACTCCTGTGAGGCAGACAATGGTCTGGAGGCCCAGCGCAGTGAGATGGTGAC 
2 B 2241 TCAAACTTCTGGAGTTTTTATGATCCCTTGTTTTCTAACCTGCTAAAATCAGTATCATTTTATTGTATTATTTTAAAAAA 

LKVAVPVSRPVLTLRAPGTHAAVGDLL 769 
2 C 2321 ACTGAAAGTTGCAGTTCCGGTGTCTCGCCCGGTCCTCACCCTCAGGGCTCCCGGGACCCATGCTGCGGTGGGGG ACCTGC 

G EWALPTSSTSEN* 759 

2A 2321 GGTGAGTGGGCCCTGCCCACCAGCAGCACATCTGAGAACTGACTGTGCCTGTTCTCCCTGCAGCTGA 

2 B 2321 ACTATTGTTGAAGTATGACATACATTCAAGAAACGTGTGCAAATTGTATGTGTACGATTTGGTGTCTTTTTAGGAGCTAA 

ELHCEALRGSPLILYRFFHEDVTLG j§| 795 
2C 2 401 TGGAGCTTCACTGTGAGGCCCTGAGAGGCTCTCCCCTGATCCTGTACCGGTTTTTTCATGAGGATGTCACCCTAGGJI^Sr 
2A 2401 AAATGGAGCCACAGAGCTCCTCAGGGCTGTTTGCTTGTGTGGCATCCCAGCACACTTCCTGCCTGCAGAACCTCCCTGTG 

Wy-'is P S G G A S L £rip;s"- L T A E H S G Sf^fx E A D 822 
2 C 2481 ^G^TOTCCCCCTCTGGAGGAGCGTCCTTA^^^^CTCTGACTGCAGAGCACTCTGGA HgFATTO 2 T GTGAGGCCGA 
2A 2481 AAAGTCTCGGATCCTTTGTGGTATGGTTCCAGGAATCTGATGTTTCCCAGCAGTCTTCTTGAAGATGATCAAAGCACCTC 
2 B 2481 TGATAGAAAAATCTTGAGCCTGATGTGTCAGACATGCCCCTAGCATAACTTGTTGAGTAAAGAGGTTATTTTTAAAATGT 

NGLGAQRSETVTLYITGLTA fS^R^S.' G P F A 849 
2C 2 561 CAATGGCCTCGGGGCCCAGCGCAGTGAGACAGTGACACTTTATATCACAGGGCTGACCGCG^^A^^TGGCCCTTTTG 
2 A 2 561 ACTAAAAATGCAAATAAGACTTTTTTAGAACATAAACTATATTCTGAACTGAAATTATTACATGAAAATGAAACCAAAGA 
2B 2 561 GAATGTTCTGAGACTACTCCAAAGTCAGAGCCAAATCTACTAGGAAGCTTCTAGACTTCACTCATTCTGCATCCCATTAC 

T G V A G G L L S I A G L A A G A I, T, L Y P W T., S R 875 
2C 2 641 CCACAGGAGTCGCCGGGGGCCTGCTCAGCATAGCAGGCCTTGCTGCGGGGGCACTGCTGCTCTACTGCTGGCTCTCGAGA 
2A 2641 ATTCTGAGCATATGTTTCTCTGCCGTAGAAAGGATTAAGCTGTTTCTTGTCCGGATTCTTCTCTCATTGACTTCTAAGAA 
2B 2 641 TATCTTTTTATCCATGTTTTACTTTCTTCTCATATTCAGCAGCATCTTAAGCCTCTTTATTTTCTGTTTCTTGACTGTCA 

KAGRKPASDPARSPSDSDSQEPT Y H N V 902 
2C 2721 AAAGCAGGGAGAAAGCCTGCCTCTGACCCCGCCAGGAGCCCTTCAGACTCGGACTCCCAAGAGCCCACCTATCACAATGT 
2 A 2721 GCCTCTACTCTTGAGTCTCTTTCATTACTGGGGATGTAAATGTTCCTTACATTTCCAC ATTAAA AATCCTATGTTAACGA 
2 B 2721 CCCTTAATGCCAGTAGAATGTAAGCTTCATGAGAACAGAACTGCATCCATCTTGGTCTTCACAACATCCCTGTGCCTACT 

PAWEELQPVYTNANPRGENVV Y S E V R I 929 
2 C 2801 ACCAGCCTGGGAAGAGCTGCAACCAGrGTACACTAATGCAAATCCTAGAGGAGAAAATGTGGTTTACTCAGAAGTACGGA 
2A 2801 AAAAA 

2B 2 801 CAGTGTTTGGCACACAGTAGGTCCTCAGTCAACATTTGTAATTTAGTGGACAGATGATATGACAAGATGATAAGAGGGGA 

IQEKKKHAVASDPRHLRNKGSPI I Y. S 955 

2 C 2881 TCATCCAAGAGAAAAAGAAACATGCAGTGGCCTCTGACCCCAGGCATCTCAGGAACAAGGGTTCCCCTATCATCTACTCT 
2 B 2881 TTTAAAAAAATCATCTAGCAAAGCCCAAGAGGAAAAAAAACAAAGCTATTTTAGAAATGAAATACCAATTTGAAGCAGTA 

g \J KVASTPVSGSLFLASSAPHR* Stop 977 

2C 2 961 GAAGTTAAGGTGGCGTCAACCCCGGTTTCCGGATCCCTGTTCTTGGCTTCCTCAGCTCCTCACAGATGAGTCCACACGTC 
2B 2961 AGAATAGATTGGATATCTTTGAAAACCATTAATTGAATGAAGAACCAATTTGAGAAAACAATACAGAATGCAAAGTAGAA 

2B 3041 AGATACAGAAATAAAGGCAAAAGTTATAATATGGAAATCAGACAATGGATTTGTCTGTATCCAGTTATGTGGATAATTAA 

2C 3121 GCCCCAGACCATATCTACTGGCCTTTGTTTCACATGTCCTCATTCTCAGTCTGACCAGAATGCAGGGCCCTGCTGGACTG 
2 B 3121 AATGGAGACCCTCAGAAAATTGAACCGAAGAGTAAAATGAAACTCAAAAATGTAGTAGAAATTGTTGGGAAGTAAAGAAA 

2C 3201 TCACCTGTTTCCCAGTTAAAGCCCTGACTGGCAGGTTTTTTAATCCAGTGGCAAGGTGCTCCCACTCCAGGGCCCAGCAC 
2B 32 01 ACTTGAATATGTAGATCAGAACATATATGTTGATGACGTTATTGACTTTGAGGTTAAAAATATATATATGTGCCTATGAT 

2C 3281 ATCTCCTGGATTCCTTAGTGGGCTTCAGCTGTGGTTGCTGTTCTGAGTACTGCTCTCATCACACCCCCACAGAGGGGGTC 
2B 32 81 TATGGGGAAAAAAGCAGTCGTCTCAGAAAGAAAAACATCAAGTTAGTCTTAGACTTTGCAGTGCACTCAGTACCAAAGAG 

3361 TTACCACACAAAGGGAGAGTGGGCCTTCAGGAGATGCCGGGCTGGCCTAACAGCTCAGGTGCTCCTAAACTCCGACACAG 

3441 AGTTCCTGCTTTGGGTGGATGCATTTCTCAATTGTCATCAGCCTGGTGGGGCTACTGCAGTGTGCTGCCAAATGGGACAG 

3521 CACACAGCCTGTGCACATGGGACATGTGATGGGTCTCCCCACGGGGGCTGCATTTCACACTCCTCCACCTGTCTCAAACT 

3 601 CTAAGGTCGGCACTTGACACCAAGGTAACTTCTCTCCTGCTCATGTGTCAGTGTCTACCTGCCCAAGTAAGTGGCTTTCA 

3 681 TACACCAAGTCCCGAAGTTCTTCCCATCCTAACAGAAGTAACCCAGCAAGTCAAGGCCAGGAGGACCAGGGGTGCAGACA 

37 61 GAACACATACTGGAACACAGGAGGTGCTCAATTACTATTTGACTGACTGACTGAATGAATGAATGAATGAGGAAGAAAAC 

3841 TGTGGGTAATCAAACTGGCATAAAATCCAGTGCACTCCCTAGGAAATCCGGGAGGTATTCTGGCTTCCTAAGAAACAACG 

3921 GAAGAGAAGGAGCTTGGATGAAGAAACTGTTCAGCAAGAAGAAGGGCTTCTTCACACTTTTATGTGCTTGTGGATCACCT 

4001 GAGGATCTGTGAAAATACAGATACTGATTCAGTGGGTCTGTGTAGAGCCTGAGACTGCCATTCTAACATGTTCCCAGGGG 
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4081 ATGCTGATGCTGCTGGCCCTGGGACTGCACTGCATGCATGTGAAGCCCTATAGGTCTCAGCAGAGGCCCATGGAGAGGGA 

4161 ATGTGTGGCTCTGGCTGCCCAGGGCCCAACTCGGTTCACACGGATCGTGCTGCTCCCTGGCCAGCCTTTGGCCACAGCAC 

4241 CACCAGCTGCTGTTGCTGAGAGAGCTTCTTCTCTGTGAC ATGTTGGCTTTCATCAGCCACCCTGGGAAGCGGAAAGTAGC 

4321 TGCCACTATCTTTGTTTCCCCACCTCAGGCCTCACACTTTCCCATGAAAAGGGTGAATGTATATAACCTGAGCCCTCTCC 

4401 ATTCAGAGTTGTTCTCCCATCTCTGAGCAATGGGATGTTCTGTTCCGCTTTTATGATATCCATCACATCTTATCTTGATC 

4481 TTTGCTCCCAGTGGATTGTACAGTGATGACTTTTAAGCCCCACGGCCCTGAAATAAAATCCTTCCAAGGGCATTGGAAGC 

4561 TCACTCCACCTGAACCATGGCTTTTCATGCTTCCAAGTGTCAGGGCCTTGCCCAGATAGACAGGGCTGACTCTGCTGCCC 

4641 CAACCTTTCAAGGAGGAAACCAGACACCTGAGACAGGAGCCTGTATGCAGCCCAGTGCAGCCTTGCAGAGGACAAGGCTG 

4721 GAGGCATTTGTCATCACTACAGATATGCAACTAAAATAGACGTGGAGCAAGAGAAATGCATTCCCACCGAGGCCGCTTTT 

4801 TTAGGCCTAGTTGAAAGTCAAGAAGGACAGCAGCAAGCATAGGCTCAGGATTAAAGAAAAAAATCTGCTCACAGTCTGTT 

4881 CTGGAGGTCACATCACCAACAAAGCTCACGCCCTATGCAGTTCTGAGAAGGTGGAGGCACCAGGCTCAAAAGAGGAAATT 

4961 TAGAATTTCTCATTGGGAGAGTAAGGTACCCCCATCCCAGAATGATAACTGCACAGTGGCAGAACAAACTCCACCCTAAT 

5041 GTGGGTGGACCCCATCCAGTCTGTTGAAGGCCTGAATGTAACAAAAGGGCTTATTCTTCCTCAAGTAAGGGGGAACTCCT 

5121 GCTTTGGGCTGGGACATAAGTTTTTCTGCTTTCAGACGCAAACTGAAAAATGGCTCTTCTTGGGTCTTGAGCTTGCTGGC 

5201 ATATGGACTGAAAGAAACTATGCTATTGGATCTCCTGGATCTCCAGCTTGCTGACTGCAGATCTTGAGATATGTCAGCCT 

5281 CTACAGTCACAAGAGCTAATTCATTCT AATAAA CCAATCTTTC 
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1 AGTGAAGGGGTTTCCCATATGAAAAATACAGAAAGAATTATTTGAATACTA 
52 GCAAATACACAACTTGATATTTCTAGAGAACCCAGGCACAGTCTTGGAGAC 
103 ATTACTCCTGAG AGACTGCAGCTGATGGAAGATGAGCCCCAACTTCTAAAA 
154 ATGTATCACTACCGGGATTGAGATACAAACAGCATTTAGGAAGGTCTCATC 
205 TGAGTAGCAGCTTCCTGCCCTCCTTCTTGGAGATAAGTCGGGCTTTTGGTG 

2 56 AGACAGACTTTCCCAACCCTCTGCCCGGCCGGTGCCCATGCTTCTGTGGCT 

1 M L L W L 

3 07 GCTGCTGCTGATCCTGACTCCTGGAAGAGAACAATCAGGGGTGGCCCCAAA 

6LLLILTPGREQSGVAPK 
3 58 AGCTGTACTTCTCCTCAATCCTCCATGGTCCACAGCCTTCAAAGGAGAAAA 

23AVLLLNPPWSTAFKGEK 
409 AGTGGCTCTCATATGCAGCAGCATATCACATTCCCTAGCCCAGGGAGACAC 

40 VALICSSISHSLAQGDT 
460 ATATTGGTATCACGATGAGAAGTTGTTGAAAATAAAACATGACAAGATCCA 

57YWYHDEKLLKIKHDKIQ 
511 AATTACAGAGCCTGGAAATTACCAATGTAAGACCCGAGGATCCTCCCTCAG 

74ITEPGNYQCKTRGSSLS 
562 TGATGCCGTGCATGTGGAATTTTCACCTGACTGGCTGATCCTGCAGGCTTT 

91DAVHVEFSPDWLILQAL 
613 ACATCCTGTCTTTGAAGGAGACAATGTCATTCTGAGATGTCAGGGGAAAGA 
108 HPVFEGDNVILRCQGKD 
664 CAACAAAAACACTCATCAAAAGGTTTACTACAAGGATGGAAAACAGCTTCC 
125 NKNTHQKVYYKDGKQLP 
715 TAATAGTTATAATTTAGAGAAGATCACAGTGAATTCAGTCTCCAGGGATAA 
142 NSYNLEKITVNSVSRDN 
766 TAGCAAATATCATTGTACTGCTTATAGGAAGTTTTACATACTTGACATTGA 
159 SKYHCTAYRKFYILDIE 
817 AGTAACTTCAAAACCCCTAAATATCCAAGTTCAAGAGCTGTTTCTACATCC 
176 VTSKPLNIQVQELFLHP 
868 TGTGCTGAGAGCCAGCTCTTCCACGCCCATAGAGGGGAGTCCCATGACCCT 
193 VLRASSSTPIEGSPMTL 
919 GACCTGTGAGACCCAGCTCTCTCCACAGAGGCCAGATGTCCAGCTGCAATT 
210 TCETQLSPQRPDVQLQF 
970 CTCCCTCTTCAGAGATAGCCAGACCCTCGGATTGGGCTGGAGCAGGTCCCC 
227 SLFRDSQTLGLGWSRSP 

1021 CAGACTCCAGATCCCTGCCATGTGGACTGAAGACTCAGGGTCTTACTGGTG 
244 RLQIPAMWTEDSGSYWC 

1072 TGAGGTGGAGACAGTGACTCACAGCATCAAAAAAAGGAGCCTGAGATCTCA 
261 EVETVTHSIKKRSLRSQ 

1123 GATACGTGTACAGAGAGTCCCTGTGTCTAATGTGAATCTAGAGATCCGGCC 
278 IRVQRVPVSNVNLEIRP 

1174 CACCGGAGGGCAGCTGATTGAAGGAGAAAATATGGTCCTTATTTGCTCAGT 
295 TGGQLIEGENMVLICSV 

1225 AGCCCAGGGTTCAGGGACTGTCACATTCTCCTGGCACAAAGAAGGAAGAGT 
312 AQGSGTVTFSWHKEGRV 

1276 AAGAAGCCTGGGTAGAAAGACCCAGCGTTCCCTGTTGGCAGAGCTGCATGT 
329 RSLGRKTQRSLLAELHV 

1327 TCTCACCGTGAAGGAGAGTGATGCAGGGAGATACTACTGTGCAGCTGATAA 
346 LTVKESDAGRYYCAADN 

1378 CGTTCACAGCCCCATCCTCAGCACGTGGATTCGAGTCACCGTGAGAATTCC 
363 VHSPILSTWIRVTVRIP 

1429 GGTATCTCACCCTGTCCTCACCTTCAGGGCTCCCAGGGCCCACACTGTGGT 
3 80 VSHPVLTFRAPRAHTVV 

1480 GGGGGACCTGCTGGAGCTTCACTGTGAGTCCCTGAGAGGCTCTCCCCCGAT 
397 GDLLELHCESLRGSPPI 

1531 CCTGTACCGATTTTATCATGAGGATGTCACCCTGGGGAACAGCTCAGCCCC 
414 LYRFYHEDVTLGNS SAP 

1582 CTCTGGAGGAGGAGCCTCCTTCAACCTCTCTCTGACTGCAGAACATTCTGG 
431 SGGGASFNLSLTAEHSG 

1633 AAACTACTCCTGTGATGCAGACAATGGCCTGGGGGCCCAGCACAGTCATGG 
448 NYSCDADNGLGAQHSHG 

1684 AGTGAGTCTCAGGGTCACAGTTCCGGTGTCTCGCCCCGTCCTCACCCTCAG 
465 VSLRVTVPVSRPVLTLR 

173 5 GGCTCCCGGGGCCCAGGCTGTGGTGGGGGACCTGCTGGAGCTTCACTGTGA 
483 APGAQAVVGDLiLELHCE 

1786 GTCCCTGAGAGGCTCCTTCCCGATCCTGTACTGGTTTTATCACGAGGATGA 
499 SLRGSFPILYWFYHEDD 

1837 CACCTTGGGGAACATCTCGGCCCACTCTGGAGGAGGGGCATCCTTCAACCT 
516 TLGNISAHSGGGASFNL 

1888 CTCTCTGACTACAGAACATTCTGGAAACTACTCATGTGAGGCTGACAATGG 
533 SLTTEHSGNYSCEADNG 
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1939 CCTGGGGGCCCAGCACAGTAAAGTGGTGACACTCAATGTTACAGGAACTTC 

550 LGAQHSKVVTLNVTGTS 
1990 CAGGAACAGAACAGGCCTTACCGCTGCGGGAATCACGGGGCTGGTGCTCAG 

567 RNRTGLTAAGITGLVLS 
2041 CATCCTCGTCCTTGCTGCTGCTGCTGCTCTGCTGCATTACGCCAGGGCCCG 

584 ILVLAAAAALLHYARAR 
2 092 AAGGAAACCAGGAGGACTTTCTGCCACTGGAACATCTAGTCACAGTCCTAG 

601 RKPGGLSATGTSSHSPS 
2143 TGAGTGTCAGGAGCCTTCCTCGTCCAGGCCTTCCAGGATAGACCCTCAAGA 

618 ECQEPSSSRPSRIDPQE 
2194 GCCCACTCACTCTAAACCACTAGCCCCAATGGAGCTGGAGCCAATGTACAG 

635 PTHSKPLAPMELEPMYS 
2245 CAATGTAAATCCTGGAGATAGCAACCCGATTTATTCCCAGATCTGGAGCAT 

652 NVNPGDSNPIYSQIWSI 
2296 CCAGCATACAAAAGAAAACTCAGCTAATTGTCCAATGATGCATCAAGAGCA 

669 QHTKENSANCPMMHQEH 
2347 TGAGGAACTTACAGTCCTCTATTCAGAACTGAAGAAGACACACCCAGACGA 

686 EELTVLYSELKKTHPDD 
2398 CTCTGCAGGGGAGGCTAGCAGCAGAGGCAGGGCCCATGAAGAAGATGATGA 

703 SAGEASSRGRAHEEDDE 
2449 AGAAAACTATGAGAATGTACCACGTGTATTACTGGCCTCAGACCACTAGCC 

720 ENYENVPRVLLASDH 

2551 TCTCTCCAACCACACATCATCCATCTCTCCAGACTCTGCCTCCTACGAGGC 
2 602 TGGGCTGCAGGGTATGTGAGGCTGAGCAAAAGGTCTGCAAATCTCCCCTGT 
2 653 GCCTGATCTGTGTGTTCCCCAGGAAGAGAGCAGGCAGCCTCTGAGCAAGCA 
2704 CTGTGTTATTTTCACAGTGGAGACACGTGGCAAGGCAGGAGGGCCCTCAGC 
2755 TCCTAGGGCTGTCGAATAGAGGAGGAGAGAGAAATGGTCTAGCCAGGGTTA 
2 805 CAAGGGCACAATCATGACCATTTGATCCAAGTGTGATCGAAAGCTGTTAAT 
2 857 GTGCTCTCTGTATAAACAATTTGCTCCAAATATTTTGTTTCCCTTTTTTGT 
2908 GTGGCTGGTAGTGGCATTGCTGATGTTTTGGTGTATATGCTGTATCCTTGC 
2959 TACCATATTGGG 
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1 TGGTGACCAAGAGTACATCTCTTTTCAAATAGCTGGATTAGGTCCTCATGC 
1 ML 
5 2 TGCTGTGGTCATTGCTGGTCATCTTTGATGCAGTCACTGAACAGGCAGATT 
19 LWSLLVIFDAVTEQADS 

103 CGCTGACCCTTGTGGCGCCCTCTTCTGTCTTCGAAGGAGACAGCATCGTTC 
36 LTLVAPSSVFEGDSIVL 

154 TGAAATGCCAGGGAGAACAGAACTGGAAAATTCAGAAGATGGCTTACCATA 
53 KCQGEQNWKIQKMAYHK 

205 AGGATAACAAAGAGTTATCTGTTTTCAAAAAATTCTCAGATTTCCTTATCC 
70 DNKELSVFKKFSDFLIQ 

256 AAAGTGCAGTTTTAAGTGACAGTGGTAACTATTTCTGTAGTACCAAAGGAC 
87 SAVLSDSGNYFCSTKGQ 

307 AACTCTTTCTCTGGGATAAAACTTCAAATATAGTAAAGATAAAAGTCCAAG 

104 LFLWDKTSNIVKIKVQE 

358 AGCTCTTTCAACGTCCTGTGCTGACTGCCAGCTCCTTCCAGCCCATCGAAG 
121 LFQRPVLTASSFQPIEG 

409 GGGGTCCAGTGAGCCTGAAATGTGAGACCCGGCTCTCTCCACAGAGGTTGG 
138 GPVSLKCETRLSPQRLD 

460 ATGTTCAACTCCAGTTCTGCTTCTTCAGAGAAAACCAGGTCCTGGGGTCAG 

155 VQLQFCFFRENQVLGSG 

511 GCTGGAGCAGCTCTCCGGAGCTCCAGATTTCTGCCGTGTGGAGTGAAGACA 
172 WSSSPELQISAVWSEDT 
562 CAGGGTCTTACTGGTGCAAGGCAGAAACGGTGACTCACAGGATCAGAAAAC 
189 GSYWCKAETVTHRIRKQ 
613 AGAGCCTCCAATCCCAGATTCACGTGCAGAGAATCCCCATCTCTAATGTAA 

206 SLQSQIHVQRIPISNVS 
664 GCTTGGAGATCCGGGCCCCCGGGGGACAGGTGACTGAAGGACAAAAACTGA 
223 LEIRAPGGQVTEGQKLI 
715 TCCTGCTCTGCTCAGTGGCTGGGGGTACAGGAAATGTCACATTCTCCTGGT 
240 LLCSVAGGTGNVTFSWY 
766 ACAGAGAGGCCACAGGAACCAGTATGGGAAAGAAAACCCAGCGTTCCCTGT 

257 REATGTSMGKKTQRSLS 
817 CAGCAGAGCTGGAGATCCCAGCTGTGAAAGAGAGTGATGCCGGCAAATATT 
274 AELEI PAVKESDAGKYY 
868 ACTGTAGAGCTGACAACGGCCATGTGCCTATCCAGAGCAAGGTGGTGAATA 
291 CRADNGHVPIQSKVVNI 
919 TCCCTGTGAGAATTCCAGTGTCTCGCCCTGTCCTCACCCTCAGGTCTCCTG 

308 PVRIPVSRPVLTLRSPG 
970 GGGCCCAGGCTGCAGTGGGGGACCTGCTGGAGCTTCACTGTGAGGCCCTGA 
325 AQAAVGDLLELHCEALR 

1021 GAGGCTCTCCCCCAATCTTGTACCAATTTTATCATGAGGATGTCACCCTTG 
342 GSPPILiYQFYHEDVTLG 
1072 GGAACAGCTCGGCCCCCTCTGGAGGAGGGGCCTCCTTCAACCTCTCTTTGA 

359 NSSAPSGGGASFNLSLT 
1123 CTGCAGAACATTCTGGAAACTACTCCTGTGAGGCCAACAACGGCCTGGGGG 

376 AEHSGNYSCEANNGLGA 
1174 CCCAGTGCAGTGAGGCAGTGCCAGTCTCCATCTCAGGACCTGATGGCTATA 

393 QCSEAVPVSISGPDGYR 
1225 GAAGAGACCTCATGACAGCTGGAGTTCTCTGGGGACTGTTTGGTGTCCTTG 

410 RDLMTAGVLWGLFGVLG 
1276 GTTTCACTGGTGTTGCTTTGCTGTTGTATGCCTTGTTCCACAAGATATCAG 

427 FTGVALLLYALFHKI SG 
1327 GAGAAAGTTCTGCCACTAATGAACCCAGAGGGGCTTCCAGGCCAAATCCTC 

444 ESSATNEPRGASRPNPQ 
1378 AAGAGTTCACCTATTCAAGCCCAACCCCAGACATGGAGGAGCTGCAGCCAG 

461 EFTYSSPTPDMEELQPV 
1429 TGTATGTCAATGTGGGCTCTGTAGATGTGGATGTGGTTTATTCTCAGGTCT 

478 YVNVGSVDVDVVYSQVW 
1480 GGAGCATGCAGCAGCCAGAAAGCTCAGCAAACATCAGGACACTTCTGGAGA 

495 SMQQPESSANIRTLLEN 
1 531 ACAAGGACTCCCAAGTCATCTACTCTTCTGTGAAGAAATCATAACACTTGG 

512 KDSQVIYSSVKKS 

1582 AGGAATCAGAAGGGAAGATCAACAGCAAGGATGGGGCATCATTAAGACTTG 
1633 CTATAAAACCTTATGAAAATGCTTGAGGCTTATCACCTGCCACAGCCAGAA 
1684 CGTGCCTCAGGAGGCACCTCCTGTCATTTTTGTCCTGATGATGTTTCTTCT 
1735 CCAATATCTTCTTTTACCTATCAATATTCATTGAACTGCTGCTACATCCAG 
1786 ACACTGTGCAAATAAATTATTTCTGCTACCTTCTCTTAAGCAATCAGTGTG 
1837 TAAAGATTTGAGGGAAGAATGAATAAGAGATACAAGGTCTCACCTTCATCT 
1888 ACTGTGAAGTGATGAGAACAGGACTTGATAGTGGTGTATTAACTTATT'TAT 
193 9 GTGCTGCTGGATACAGTTrGCTAATATTTTGTTGAGAATTTTTGC AAATAT 
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1990 GTTCATTGGGAATATTGGCCTGAAATTTTCTTTTCCACTGTGTCTCTGCCA 
2041 GAATGTTTGTATCAGGCTGATGCTGGCTTCATAGAATGAGTTAGGCAGGAG 
2092 CCCTTCCTCCTTGATTTTTTGGCATAGTTTCAGCAGGATTGGTACCAGTTA 
2143 TTCTTTCTGCATCTTGTAGAATTCAGCTATGAATCCATCTGGTCTAGGGCT 
2194 TTTGTGTTGGTTGGTAAGTTTTTTATTACTAATTCAACTTCAGCGCTTGAT 
2245 ATTGGTCTAGGAGGGGTTTCTGTCTCTTCCTGGTTCAATCTTGGGAG ATTG 
2296 TGTGTTTCCAGGAATTTAGCCGTTTCCTCCAGATTTTCTTCTTTATGTGCA 
2347 TCGACTTGAGTGTAAACATAACTTATATGCACTGGGAAACCAAAAAATCTG 
23 98 TGTGACTTGCTTTATTGCAGCATTTGTTTTATTTTGGTAGTCTGGAACTGA 
2449 ACCTGCAATATCACCAAAGTATGCATATAGTTGCAAAAATGTGATTTTTGA 
2500 CATAGTAAATATGAGTATTTGCAATAAACTATGATATTACTTTTGTAAGTA 
2551 TATAGAATAAAATGTAAATAATCTATAAAA 
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1 GAGGCATCTCT AGGT ACCATCCCTG ACCTGGTC CTC 

3 7 ATGCTGCCGAGGCTGTTGCTGTTGATCTGTGCTCCACTCTGTGAA 
MLPRLLLLICAPLCE 

PAELFLIASPSHPTE 
127 GGGAGCCCAGTGACCCTGACGTGTAAGATGCCCTTTCTACAGAGT 

GSPVTLTCKMPFLQS 
172 TCAGATGCCCAGTTCCAGTTCTGCTTTTTCAGAGACACCCGGGCC 

SDAQFQFCFFRDTRA 
217 TTGGGCCCAGGCTGGAGCAGCTCCCCCAAGCTCCAGATCGCTGCC 

LGPGWSSSPKLQIAA 
262 ATGTGGAAAGAAGACACAGGGTCATACTGGTGCGAGGCACAGACA 

MWKEDTGSYWCEAQT 
3 07 ATGGCGTCCAAAGTCTTGAGGAGCAGGAGATCCCAGATAAATGTG 

MASKVLRSRRSQINV 
3 52 CACAGGGTCCCTGTCGCTGATGTGAGCTTGGAGACTCAGCCCCCA 

HRVPVADVSLiETQPP 
3 97 GGAGGACAGGTGATGGAGGGAGACAGGCTGGTCCTCATCTGCTCA 

GGQVMEGDRLVLICS 
442 GTTGCTATGGGCACAGGAGACATCACCTTCCTTTGGTACAAAGGG 

VAMGTGDITFLWYKG 
487 GCTGTAGGTTTAAACCTTCAGTCAAAGACCCAGCGTTCACTGACA 

AVGLNLQSKTQRSLT 
532 GCAGAGTATGAGATTCCTTCAGTGAGGGAGAGTGATGCTGAGCAA 

AEYEIPSVRESDAEQ 
577 TATTACTGTGTAGCTGAAAATGGCTATGGTCCCAGCCCCAGTGGG 

YYCVAENGYGPSPSG 
622 CTGGTGAGCATCACTGTCAGAATCCCGGTGTCTCGCCCAATCCTC 

LVSITVRIPVSRPIL 
667 ATGCTCAGGGCTCCCAGGGCCCAGGCTGCAGTGGAGG ATGTGCTG 

MLRAPRAQAAVEDVL 
712 GAGCTTCACTGTGAGGCCCTGAGAGGCTCTCCTCCAATCCTGTAC 

ELHCEALRGSPPILY 

7 57 TGGTTTTATCACGAGGATATCACCCTGGGGAGCAGGTCGGCCCCC 

WFYHEDITLGSRSAP 

8 02 TCTGGAGGAGGAGCCTCCTTCAACCTTTCCCTGACTGAAGAACAT 

SGGGASFNLSLTEEH 
847 TCTGGAAACTACTCCTGTGAGGCCAACAATGGCCTGGGGGCCCAG 

SGNYSCEANNGLGAQ 
892 CGCAGTGAGGCGGTGACACTCAACTTCACAGTGCCTACTGGGGCC 

RSEAVTLNFTVPTGA 
937 AGAAGCAATCATCTTACCTCAGGAGTCATTGAGGGGCTGCTCAGC 

RSNHLTSGVIEGLLS 

9 82 ACCCTTGGTCCAGCCACCGTGGCCTTATTATTTTGCTACGGCCTC 

TLGPATVALLFCYGL 
1027 AAAAGAAAAATAGGAAGACGTTCAGCCAGGGATCCACTCAGGAGC 

KRKIGRRSARDPLRS 
1072 CTTCCCAGCCCTCTACCCCAAGAGTTCACCTACCTCAACTCACCT 

LPSPLPQEFTYLNSP 
11 17 ACCCCAGGGCAGCTACAGCCTATATATGAAAATGTGAATGTTGTA 

TPGQLQPIYENVNVV 
1162 AGTGGGGATGAGGTTTATTCACTGGCGTACTATAACCAGCCGGAG 

SGDEVYSLAYYNQPE 
1207 CAGGAATCAGTAGCAGCAGAAACCCTGGGGACACATATGGAGGAC 

QESVAAETLGTHMED 
1252 AAGGTTTCCTTAGACATCTATTCCAGGCTGAGGAAAGCAAACATT 

KVSLDIYSRLRKANI 
1297 ACAGATGTGGACTATGAAGATGCTATGTAA 1326 

TDVDYEDAM* 

GGTT ATGGAAGATT CTGCTCTTTG 
13 51 AAAACCATCC ATGACCCCAA GCCTCAGGCC TGATATGTTC TTCAGAGATC 
1401 CTGGGGCATT AGCTTTCCAG TATACCTCTT CTGGATGCCA TTCTCCATGG 
1451 CACTATTCCT TCATCTACTG TGAAGTGAAG TTGGCGCAGC CCTGAAGAAA 
1501 CTACCTAGGA GAACTAATAG ACACAGGAGT GACAGGGACT TTGTTATCAG 
1551 AACCAGATTC CTGCCGGCTC CTTTGAAAAC AGGTCATATT GTGCTCTTCT 
1601 GTTTACAAGA GGAAACAAGA TGGAATAAAA GAAATTGGGA TCTTGGGTTG 
1651 GAGGGACAGT GAAGCTTAGA GCACATGAAC TCAAGGTTAG TGACTCTGCA 
1701 GGACTTCACA GAGAGAGCTG TGCCCATCAT TCAGTCCAAG TGCTTTCTCT 
1751 GCCCAGACAG CACAGAACTC CAGCCCCGCT ACTTACATGG ATCATCGAGT 
1801 TTCCACCTAA AATATGATTC TATTTATTTT GAGTCACTGT TACCAAATTA 
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1851 GAACTAAAAC AAAGTTACAT 

1901 GTGACGTATT TTTGTATATA 

1951 GTATCTATTA CAGCCCCTAG 

2001 ATTCACAAAA TTTTTGAAAT 

2051 AATTATTTTT TTTTTAAATT 

2101 GGAATGCAGT GGCACAATCT 

2151 CAAGCAAACC TCTCACCTCA 

2201 TGCCACCAAA CTTGGCCATT 

2251 GTTTTGCCCA GGCTGGTCTC 

2301 TTT 



AAAAAGTTAT TGTGACTCCA CTTAATTTTA 
TAGGCCAACC TATACCACAT CCAAAATTAT 
AAGCTTTATA AATACAGTGT GTCTTCTTTT 
CGTGG TAATA TGGTTTGAAA CCTGTATCTT 
GAGACAGGGT CTCACTCTGT CACTCAATCT 
TGCCTCACTG CAACGCCTGC CTCTCAGGCT 
GCCTGCTGAG TAGCTGGGAC TACAGGCACA 
TTTTGTCTTA CGTAGAGACA AGATTTCACC 
AAACTCCTGG GCTCAAGCAA TGTATTGAAT 
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